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introduction 

Micrul'nity'i /eus Architecture describes general purpose pcoccsaor, memory, snd interface 
subsystems, organized to operate at the enormously high btndwk th rates required hx 
hn ladhand appbcan< »ns. 

"Jlie /cut processor reforms integer, floating point, signal proceLiing and non-bnear 
opcranont tuch as Galois field, table lookup and hit twitching on data sixes from 1 bit to 
128 bits. Group or SIMD (single in it rue Don multiple data) operations sustain external 
operand bandwidth rates up to 512 bits (i.e., up to four lA-btt operand groups) per 
instruction even or. data items of small sue. The processor performs cnsemH: operations 
such at convolution that maintain full intermediate precision with aggregate tnccrrsai operand 
bandwidth rates up to 20,000 bits per instruction. The processor perform* wide operations 
such as crossbar switch, matrix multiply and table lookup that use caches irnbedded in the 
execution units themselves to extend operands to as much aa 32768 bit*. All kutrucriona 
pmdivt* at moat a single 128- bit register result, source at moat three 128-bit registers and are 
fu% of side effect such as the setting of condition codes and flags. The mstrucbon set 
design carries the concept of streamlining beyond Reduced Instruction Set Computer (R1SQ 
architectures, to umpfafy imrUcrncntaooos that issue several instructions per machine cycle. 

The /cut memory tuhtystem provides 64- bit wtu^ and physical addressing for UNIX, 
Mach, and other advanced OS environments. Separate address tmtrucoons enable the 
div ision of the processor into decoupled accest and execution units, to reduce the crfectrve 
latency of memory to the pipeline. The Zeus cache supplies the high dam and instruction 
issue rates of the processor, and supports coherency primitives for scakable imilnpntccssors. 
The memory subsystem includes mechanisms for sustaining high data rules not only in bock 
anafct modes, but also in non-unit stride and scstterred access parvrna. 

The /cus interface subsystem is designed to match indu«ny staiMiard "Socket T protocols 
and pin-outs. In this way, /cus can make use of the unmciisc mfrastrucrurc of the PC W 
building low-cost systems. The interface subsystem it modular, and can be replaced with 
appropriate protocols and pui-outs for lowrt cost and higher- pcrf irmancc lystc.is. 

The pMl of the /cus arch»tccf;.c it to integrate these processor, memory, and interface 
capabilities with opcimal •:.itplicity and generality. Fmm the software perspective, the enrire 
machine itate cim-..»o of a program counter, a tingle hank of 64 general-purpose 128-bit 
rcgutcn. •nJ a hncar bvtc- addressed shared memofy space with mapped interface registers. 
All Uiicrrupts and csccpnons are precise, and occur with k»w overhead 

Thu document is intended fur /cus software and hardware developers alike, and defines the 
••nrrface at which their designs muvt meet, /cus pursues the most efficient tradeoffs 
iKtwccn hardware and software complexity by making all processor, memory, and interface 
resources directly accessible to high-level language programs. 
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C onformance 

To ci sure that Zeus systems may frvcly interchange data. user level programs, system level 
progra'ns and interface devices, the Zeus system architecture trachea above the processor 
kvel at hitccturc. 

Mandatory and Optional Areas 

A computer system conforms to the requirements of tbv Zeus System Architecture if and 
only if it implements rJl the specifications described in this document and other 
sprafkaoom included i*y reference. Conformance to the apebflcabon is mandatory in aO 
areas, including the instruction set memory management system, interface devices and 
extern) interfaces, and bootstrap ROM functional retirements, except where explicit 
opbont are stated. 

Optional areas include: 

Number of processor threads 
Size of first- <cvcl cache memories 
Existence of a second-level cache 
Stxc of sccotvl-level cache memory 
Stxc of system level memory 

Existence of certain optional interface device interfaces 

Upward-compatible Modifications 

From time to nme, MicroL'nity may modify the architecture in an upward compatible 
manner, such as by the addition of new instructions, definition of reserved bits in system 
state, or addroon of new standard interfaces. Such rmxlificabons will be added as options, so 
that designs that conform to this version of the architecture will conform to future, modified 
versions. 

Additional devices and interfaces, not covered by this standard may be added in specified 
rcpuna of the physical memory apace, provided that system reset places these dc\HX« and 
interfaces tn an inacmr state that does nut interfere with the operation of software that run* 
m any conformant system. 'Ihe software interface requirements of any such addition 4) 
devices and interfaces must lie made as wide))' available as this architecture specification. 

Promotion of Optional Features 

It is most strongly recommended that such optional instructions, state or interfaces be 
impleme n ted in aO cimforminK designs. Such implementations enhance the value of the 
features in particular and the architecture -1 a uholc by hnudening the set of 
imptcmcntattiina over which software may depenu upon the presence of these features. 

Implementation that fall to implement these features may encounter unacceptable levels of 
overhead when attempting to emulatr the features by exception handlers or u>c of virtual 
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Conformance 



memory. This is a parocular concern when involved in code that has reaJ-nmc per fo r ma nce 
constraints. 

In order that upward<ornpatibk optional ex tensions of the original Zeus system 
architecture may be relied upon by system and application software, MkroUnity may upon 
occasion promote optional features to mandatory conformance for tmplcmcntatkms 
designed or produced after a suitable delay upon such notification by publication of future 
version of the specification. 

Unrestricted Physical Impl ementation 

Nothing m this specification should be construed to brat the implementation chokes of the 
conforming system beyond the specific requirements seated herein. In particular, a computer 
system may conform to the /-cut System Architecture while employing any number of 
components, dissipate any amount of heat, require any special environmental fadBdea. or be 
of any physical sac. 

Draft Version 

This document is a draft version of the architectural specification. In dm form, conformance 
to this docunxnt may not be claimed or implied. MkroUnity may change this spedficadon 
at any time, in any manner, until it has been declared final When this document has been 
declared final, the only chanters will be to correct bugs, defects or deficiencies, and so add 
upward c< impanble opoc »nal eaten?* «ts. 
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Notation 



The dc t c rip dvt nocatkm wed m this document u summirized in the table beievw: 



«♦ y 


two's complement addition of x and y. Result is the same size 
as the operands, and operands mufl be of equal size 


x • y 


two's complement subtraction of y from x Result is the same 
size as the operands, and operands must be of equal size. 


x «y 


two's complement multiplication of x and y. Result is the *ame 
size as the operands, and operands must be of equal size. 


x/y 


two's complement division of x by y. Result b the same size 
as the operandi and operands must be of equal size. 


x & y 


bitwise and of x and y. Result Is same size as the operands, 
ana operanas must be or equal sire. 


If 1 14 

* 1 y 


Ditwtse or or x and y. Result is same size as the operands, 
and operands must be of equal size 


« • ta 

* y 


bitwise exdusive-of of x and y. Result is same size as the 
operands, and operands must be of equal size. 




bitwise Inversion of x Result is same size as the operand. 


*.y 


two's complement equality comparison between x and y. j 
Result is a sinqle bit, and operands must be of equal size. 


x # y 


two's complement inequality comparison between x and y. 
Resift is a single bit and ooerands must be of eouat sJ/e. 


x c y 


cwus compicmeni pess man comparison between x and y. 

ncxii D a >RKfC oru arKJ OpcfafKJj mUSt De Or eOUdl Site. 




iwo» iwnpcmcni greater man or equal comparison between 
x and y. Result is a single bit and operands must be of equal 
size. 




floating-point square root of x 


x 1 1 y 


concatenation of bit Held x to left of b* field y 


x* 


binary digit x repeated, concatenated y times. Sire of result is 
y. 




extraction of bit y (using JittJc-cndian bit numbering) from 
value x Result is a single bit. 


*y-* 


extraction of bit field formed from bits y through z of value 
x. Sire of result is v-*H; If z>y. result is an empty strinq. 


xTya 


value of y. if x is true, otherwise value of z. Value of x i« a 
sinqle bit 


x y 


bitwise assignment of x to value of y 


Sn 


Honed, two's complement binary data format of n bytes H| 


Un 


unsigned binary data format of n bytes 


Fn 


floating-point data format of n bytes 



descriptive notation 
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Bit ordering 

The ordering of bits in this document b always hrtk-enaian, regardless of the ordering of 
bytes within larger data structures. Thus, the lew-significant bit of a data structure is always 
labeled 0 (zero), and the most-significant bit it M>drd as the data structure size (m bits) 
minus one. 

Memory 

Zeus memory is an array of 2** bytes, without a specified byte ordering, which is physical)) 
distributed among various components. 

7 0 

byte 0 
byte I 
byte 2 



* 

byte 

8 

Bvte 

A byte is a single element of the memory array, consisting of 8 bits: 

7 0 

I byte I 
8 

Byte prdennfl 

larger dau structures arc constructed from the concatenation of hyti* in either little-endian 
«* big-endian byte ordering. A memory access of a data structure of size s at address i is 
formed from memory bytes at addresses t through i+s-1. l ? iucss otherwise specified, there is 
no specific requirement of alignment it a not generally required that i be a multiple of s. 
Aligned accesses are preferred whenever possible, however, as they wiD often require one 
fewer pmces** or memory clock cycle than unaligned accesses. 

VTtth little-endian byte ordering, the bytes are arranged as: 

f8Q 1*8-8 IS 8 7 0 

I byte t+s-l | ... 1 byte Kl \ byte T \ 

8 8 8 
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With btg-endian byte <«dcring. the bytes are arranged as: 

«2J gjnM t-g.ia, 7 0 

I byte I i bvt. u i | | bvt. jgTH 

- i__ —J 

Zeus memory it bytc-addreaaed, using other hnJe-endian t» big-endian byte ordcrtnu. Pur 
wmwrncy with the bit orient*, and fur compatibility with .86 pmcwaon. Zeus «•« httle 

weatnukble for both hnle-endian and big-endian b,te ordering. The sclccttm of b>ie 
ordenng u dynamo i» that little-endian and b*-e«dun pmcesaea. and even data srnjeturcs 
wtthm a process, can be intermixed on the prt<e»a.K. 

Memory rr»* d/|oad semantfr e 

Ze us memory. mJud* g tnernury-mapprd legmen, must conform to the fijowtng 
tetfuuementi regarding ndr -effects of read or load operations: 

Amctnory read moat have no aide-effects «w the ctmtents of the addrcaaed memory nor on 
the contents of any other memory. 

Memory writc/stor*. <»m*ntfr« 

Ze us mem ory. ****** memory mapped registers, must conform to the Mowing 
teauireme»« itgarimg atdc-«ffecta of read or load operations: 



A rT^7 *** «*««»«» «'f «Ke sddressed memory at. that a mcm.« read 

ol the addressed memory retuma the appropriate portion of the value wnrtcn. 

A memory write may affect or cause aide-effecta on the contents of memory r»« addreased 
by*e wme operation, however, s second memor) write of the same value to the same 
ridt«mu« have no side«ffcet» on any mem,-,, memory write operabon. muu be 
•ocmpuccnc 

Zeus smc imtmctkms that are weak), ordered may have tide-effects on the con.ents of 
™™*] » dd «««« °) 'He .tore itself; subvruuent U.mI instructions which are alao weak* 
ordered may or may mm return values which reflect the ndeefTeeta. 

Data 

and nsteen-bytc (12S-oit) d.r. path and u« fixcdlcn^ iW-bytc 12-b.t 

SSJTfTf .t^ardTM-^T'T 11 ^>'* <>™*™ unsigned binary and 
A.N51/IF.F.F. standard 7*4- I9H5 enfomung binary floaong point number representation.. 
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Fixed-potnt Data 



A bit ib i primitive data clement: 



rCCK 

A pc*> ts the catenation of two bits: 



0 

B 

i 



t o 

gag 



A nibble ts the catenation ot knu btta: 

J 0 



A b\n ts the catcnar* «\ of c?*ht bits, and is a single element of the memory array: 

7 0 



A doublet is the catenation of 16 btta. and it the catenation of two by**: 
«* 0 

L doublet | 



16 



A ifuadlct is the catenation of te bits, and is the catenation of &*»r b)tea: 
3) 

I qcmdlet 



32 
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An octfct m the catenatkm of 64 bio. and a the ottnaain of eyht byte*. 



?2 



31 ; 

r <>cti«t3t..o' 



A hcxlct 0 the CMceum of 128 fata, am) it the catenaaun t4 sateen bytev 

t27 ^ 

jj J 



32 

31 

I ^<«t31."o~ 

32 



13 



MtcrolJniiy 



Zeus System Architecture 



Tuc.Augl7. 199V 



Gimmnn Elements 



Trfclet 

A triclct is the catenation of 256 bra, and t» the caicmtian of thirty-two bytes 

jgs 



224 



223 



tf»d*2SS..224 



37 



tf»cUt22J,lt2 1 



12 



191 

I «r»cltiti..t60" 



140 



32 



IS9 



I2t 



32 



12? 

i jjjcjjtiisr 



IT 

I tr«clt»s..44 

32 

63 

I trlcl«43..32 

32 

31 

I trtcftai.O 

32 



37 



Address 

/cu» addresses, l**h virtual addresses and phy^caJ sddresaes, M ,«Wt quantities. 

Floating-po^f I fr ifr 

Zois's floatm* p.*m brnwt are dextfned n, MO sfy ANSI/1RRF. uxxU 754-19*5: Binary 
FVtttmg-puint Arithmetic Standard 754 leaves certain aspects to the discretion of 
implcmcntcti: additional precision forms n, encoding of quiet and signaling SUN values, 
details of production snd pn<pa K ation of uuhi NaN \ alucv These aspects arc detailed below. 

Zeus sdds additional half precision and <fu»d^xrcision formats to standard 754's swgle- 
ptrcwnn and double-precision formats. Zeus's double- prtosaai satisfies standard 754"* 
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prccwm MnjuiirmcMi for * xin^c extendi l„nw.. and /,«,-. ^ pmtM<<0 %Mu(k% 
standard S4 » precision requ..cmc,.i» tor « douhlc extended lomui 

K-Kh precision formal employs field. I^W . («^n). « (exponent). 4ml f (fnc**) *, 
encode value, that arc (1) NaN: uuwt and ai^ahi* (>) mnnincs (!)*•-. (!) norrnalucd 
number, (.i^.ja. ^ (4j „ umhm: ( |)A , 2A , ^ fl ^ _ 

Quiet NaN value, art denoted bv am s.^n ha value, an exponent field of all ,.nc hit* and a 
n..n ,cm fraction w:th the most ..^.ficant hit xe. (^.-t NaN value. ,{cncr*icd hv .kuuli 
vxcc*n..n handl.n K of standard opcran.-ns have a /cm , 1K n b,i. an exponent held of dl ,. IK 
nm. a fraction he i with the most stpuficant bit set. and aU other bits cleared 

S xruhr* NaN value, are denoted by any WK n bit vaU. an eaponcnt field ofaU.^e bra, and 
a nonacn. fraction with the mat significant tut cleared. 

Infinite valuta are denoted by any sign bit value, an eapuncnt field of all oneh.tx.aod. .en, 
t racoon field 

N«.rmab*cd number value, ate denized by any ugn bit value, an caponcnt field that u not a!» 
«wc tun <« aB wit, bita. and any tract** field value. The nut eric value encoded ia 
( IjAa^ h^, 0 !V biaa U «iual the value re.ulonx »emn K all but the most 
significant bit «f the capuncnt field, haf IS, .ingle: 127. double: I02A. and uu*l 16.W.V 

Dcnorrnah.ed number value, are denoted by any .ip* bit value, an exponent field that » aO 
kto kta, and a mm ten > fraction field value. The numenc value encoded 1. M)A.vM 
•"-(HI). 

Z*ii> value, are denoted by any sign bit value, and exponent field that is «0 »cn, bits, and a 
fraco.* ficU that is aO aen, bits. TW numeric value encoded i. ( \)*H). The dianncn.»n 
between *0 and -0 is aigruficani in Mime operations 

Hatf QTtXhton FtaarJng.pninr 

/eua I . »f rmiM imi a Hmtui similar h. st.ndaul 7W, reu.u.rcmcnts, rcduc-d to » It. b« 
overaB format The format contains sufficient pricision and exponent range 10 hold a 12 bit 

t M 109 0 

1 s io 1 
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C<«r.nv»n hlcmcnt* 



SiOQgc-ijfectstQn EteflBOaJMint 

/cut tinglr prrciMM wtunct tut* ud *S4*s r» ^uirrmcnn ft* "single." 

J JO 2J22 0 

IB 2i 
Ujuldr Ofcoitoo Floating oo-nt 

/cut iiiK*hlv piiCUa.n !Mt:»tir«t mmUrii N\ rc^uirrmrim tor "duubk." 



6 62 Si*: J2 

i 



M 


• 


i 


ff!..32 J 


1 








11 0 








1 '31.0 I 
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/cut t|uMl prrcitmn viftstui standard 'M'i requirement* for "double extended ' but has 
addin»na] tract* m prciii»>n m uae I2H htt*. 



12 126 


Pi III 




96 


i.i 


1 


f?11..9A 


1 


1 IS 




16 




9S 






64 






32 






63 






32 


1 '63.32 1 




32 






31 






0 


1 C3I..0 I 
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Zeus Processor 

Mscrol nm'i /cut processor pnivturs fhr general purpose, h** Undu-Jth ciimrnitatkm 
r -inability of the Xxus intern Zeus includes hi^h har*rwidth data pathv register files, and a 
memory hierarchy, /rust rrwm«>n hierarchy includes on chip tnsrruct»m and da a 
rnemoacs, r*srrucnon and dara caches, a virtual memory facibty, and interfaces to externa] 
device*. Zcuss interfaces in the initial implementation are solely the '•Super Socket 7 M hui, 
but other implementations may have different additional interfaces. 

Architectural Framework 

*llu- Zeus arihitecture defines a compatible IramcutirL for a family of implementations with 
a ran<c of capabilities. The nilk»v ing implement* Don denned parameters arc used in the rest 
of the documen! in boldface The value indicated is for Micmlwty's first Zrm 
implementauon. 



Pa*ame 
ter 


Interpretation 


Value 


Range of legal values 


T 


number of execution threads 


4 


1 * T S 31 


CE 


logj cache blocks in first-level 
cache 


9 


0 S CE S 31 


CS 


iog2 cache wocki m Hrst-level 
cache set 


2 


0 $ CJ S 4 


CT 


existence of dedicated taot »n 
first-level cache 


1 


OS CTS 1 


LE 


togj entries in local CB 


0 


0 S LE S 3 


LB 


Local TB based on base register 


1 


0 S LB S 1 1 


GE 


1092 entries in global TB 


7 


0 S GE S 15 


GT 


)OQ2 threads which share a 
global TB 


1 


0S6TS3 



Interfaces and Block Diagram 

The first implementation of Zeus uses -socket ? M protocols and pinours. 

Instruction 

Instructions are specified hi Zeus assemblers und other code tools (assembler*) in the syntax 
of an instruction mnemonic (operation code), then optionally white space (blanks or rd») 
f«»Uowcd b) a list of operandi. 

The instrucnon mnemonics listed in this specification are in upper case (capital) letters, 
assemblers accept either upper case or lower case leiiers in the instruction mnemonics. In 
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thif specification, tnstrucuon mncmunict contain periods f\") to separate dementi to, make 
them easier to understand; assembler* ignore periods within instruction mnemonica. The 
instruction mnemonics are designed to be parsed uniquely without the separating periods. 

If the instruction produces a register result, this operand is listed first Following this 
operand, if there are «kw m »norc source operands, is a separator which may he a cumm* 
a* tx^n {**(<£"). The ctfua) separates the result operand from the unxree 
operands, and may i>pnonalh be cxprvnacd as a comma in assembler code. 'IV at sigr 
indicates that the result operand ia also a source operand, and may opnonaJ!) be expressed as 
a comma in assembler aide. If the instruction specification has an equal-sign, .n at sign in 
assembler code indicates that the result operand should be repeated aa the first source 
operand (for sample, "A.ADD.I r4@5~ ti eouivaJcnt to "A.ADDJ t4*i4.S H ) % Gmrrns 
always separate the remaining source operands. 

The result and source operands are case-sensitive; upper case and lower case letters are 
distinct. Re nter operands are specified by the names rO (or rOfl) through rf>3 (a rower case 
V immctLixh folkiwcdMn a one or two digit number from 0 to 63), or by the special 
designations of for -dp- for *rl." *rp M ft* "r62, M and "sp" for "i6\ H Integer- 
valued operands are specified by an optional sign (-) or (♦) Mowed by a number, and 
assemblers generally accept a variety of integer-valued expressions, 

(Pttri ttiQn Structure 

A Zeus instrucoon is specificaD) defined as a four-byte structure with the hroc-endxan 
ordering shown bekiw. It is different frsm the ouadlct defined above because the placement 
of instructions into memory must be independent of the byte ordering used for data 
structures. Instructions must be aligned on four-byte boundaries; in the diagram below, i 
must be a multiple *»f 4. 

31 2* 23 16 IS 87 0 

I byte f+3 I byte t>2 | byte j+J | byte I I 

8 8 8 8 

Gateway 

A Zeus gateway is specifically defined as an 8 -byte structure with the litdc-endian ordering 
shown below! A gateway contains a cinie address used to securely invoke a system call or 
procedure at a higher privilege level. Gateways are marked by pmtccbon information 
specified in the TB. Gateways must be aligned on 8-byte boundaries; in the diagram hcknv, i 
must be a multiple of & 

« S6 SS 48 47 40 39 32 

I byte 1+7 I byte j*6 I byte f>S | byte t>4 I 
* 5 8 8 

3j 24 23 16 IS 87 p 

| byte 1*3 | byte t>2 I byte I+l | byte I | 
8 8 8 8 
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1 nc gateway contains two data itcma within its irmctutt. a etide address and a new privilege 
level: 

** 21 0 

I code address | p t | 

2 

The virtual memory system can be used to designate a region of memory as containing 
gateways. Other data may be placed within the gatew ay region, provided that if an attempt is 
made to use the additional data as a gateway, that vcunty cannot h. violated. Tor example. 
64-bit data €* stack pointers which are aligned to at lea?t 4 bytes and are in httlc endun byte 
order have pl^O, so that the privilege level cannot be raised by artrmpring to use the 
additional data as a gateway. 

Uqr Statq 

The uscf state consists of hardware data structures that are accessible to all conventional 
compiled code. The Zeus user state is designed to be as regular aa possible, and consists only 
of die general registers* the program counter, and virtual memory. There are no specialized 
register* me condition codes, operating modes, rounding modes, integer multiply/divide, or 
floating-point values. 



General 



Zeus user state includes 64 general reyster*. AO arc identical; there is no dedicated xen>- 
vaJucd register, and there arc no JvUicatcd floating-point registers. 

t27 o 



REGfO[ 



WE Of 11 



REGI21 



KEGf62| 



128 



Some Zetis irutructiuiis have 64 bit remitter operands. These tipcrandi arc sign-extended to 
128 bita when written to the reciter file, and the low -order 64 bit* »t chosen when read 
from the register fJr 



def vM «- RecjAe»d|ra . *c( 
cave sire of 
64: 

val «-» RCG|m| 6 | o 

128: 
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vjf RfGfrnj 

endtAse 

vgAfritrfra we. v*4 
tt/C Of 

R£G|rn| #- va^j 64 I I w^j 0 

endune 



Program Counter 

The program counter contains the address of the curtmdy executing instruction. This 
register is implicitly manipulated by branch tnsmjcooru, and read by branch instructions that 
save a return addrest in a general register. 

*2 2 10 

t rVoqramCcHjnter |0| 

*2 2 



Privilege levc» 

Tt* privilege level register contains the privilege level of the currently executing instruction. 
This register it impbctdy rnantpularcd by branch gateway and branch down msrructiom, and 
read by branch gateway instructions that save a return address in a general register. 

10 
2 



Program Counter and Privilege Level 

The program counter and pnvilege level may be packed into a stryle octlet. This combined 
data structure is saved by the Branch Gateway instruction and restored by the Branch Down 
instruction. 

*L 2 10 

I troQT* mCounter Ej 

62 2 

System sttte 

The system state consists of the facilities not normally used by conventional compOed code. 
These facibnes provvJc mechanisms to execute such code in a fully virtual environment AD 
systrm sr-iic is memory mapped. so that it can be manipulated by compiled code. 
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Fixed-ppint 

Zeus provides load and store irumxoon* to move data between mcmorv and the registers, 
branch instructions to compare the contents of registers and to transfer contml from one 
code address to another, and arithmetic operations to pcrfiirm computation on the contents 
of register*, returning the result to registers. 

Load and Stog 

The load and smre instructums move data between manor)' and the registers. When loading 
data from memory into a register, values are zero-extended or sign-extended to nil the 
regis tcr^VThcn storing data from a register into memory, values arc truncated on the left to 
fit the specified memory region, 

IVad and store mstructiorw that specify a memory region of more than one byte may use 
other btrJc-cndun or big-endian byte ordering: the size and ordering are explicitly specified 
in the matiuctM*. Regions larger than one byte may be either aligned to addresses that are sn 
even multiple of the size of the region or of unspecified alignment: alignment checking is 
abo cxpbarJy specified in the instruction. 

Load and store batructiora specify memory addresses as the sum of a base general register 
andthc product of the size of the memory region and cither an immediate value or another 
general register. Scaling maximizes the memory space u.iich can be reached by immediate 
offsets hom a single base general register, and assists in generating memory addresses within 
iterative loops. Alignment of the address can be reduced to checking the alignment of the 
first general register 

The load and store instructions arc used for fixjcd-rx>int data as well as floating-point and 
digital signal processing data; Zeus has a single bank of registers for all data types. 

Swap instructions provide multithrerd and multiprocessor synchronization, using indivisible 

ctpenriona: add swap, compare- .wap, multiplex-swap, and double-compare- swap. A store- 

multiplex operation provides t»ie ability to indivisibly write to a portion of an octlct. These 

immjroons ****** operate on aligned octlct data, using cither little-endian or big-endian 
byte ordering. 

The fixed-point compaxc-and branch instrucbons provide all arithmetic tests for equality and 
ineoualiry of signed and unsigned fixed-point values. Tests arc performed either between rwo 
operands contained in general registers, or on the bitwise and of two operands. Depending 
on the result of the compare, either a branch is taken, or not taken. A taken branch causes an 
immediate transfer of the program counter to the target of the branch, specified hy a 12 bit 
signed offset from the location of the branch instruction. A non- taken branch r-.isc- no 
transfer, execution continues with the following instruction. 

Other branch itorrucrkms provide for unci mdi bona) transfer of control to addresses too 
distant to be reached by a 12-bit offset, and to transfer to a target while placing the location 
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foDovmg the branch into a register. The branch through gateway mstrucbun provides a 
sccive means to access code at a higher privilege level, in a form sirrahr to a normal 
procedure call 

Accessing Operations 

A subset (if general fixed point arithmetic operations is available as addressing operations 
These include add. subtract. Boolean, and simple shift opcran.imv Inese adc'rcssii^ 
operations may be performed at a pcant in the Zeus pniccssor pipeline so that they may be 
ample ted prior to or in conjunction with the execution of load and store operations in a 
"supcrspring" pipeline in which other anthmeoc operations arc deferred until the 
completion of load and store operations. 

Execution Operations 

Many of the operations used for Digital Signal Processing (DSP), which are des cri bed in 
greater detail below, are also used for performing simple scalar operatsons. These opc^oons 
perform anthmctic operations on values of ft-, 16-. J2-. 64-, or 128- bh sacs, which ate 
right-aligned in registers. These execution operations include the add, subtract, boolean and 
simple shirt operations which arc also available as addressing o perations, but further attend 
the available act to include three-operand add/subtract, mice-operand boolean, dynamic 
shifts, and bit field cjperaoons. 

flQc3ting>pQint 

Zeus provides aD the facilities mandated and recommended by ANSI/1 ERE standard 754- 
1985: Binary Roaring- p<ju*t Arithmetic, with the use of supporting software. 

Branch Conditionally 

The floating point compare- and* branch instructions provide all the comparison types 
required and suggested by the IEEE floating-point standard These floating-point 
comparisons augment the usual types of numeric value comjnrison* with special handling 
ft* NaN (not a- number) values, A NaN value compares as *"unorucrcd~ with respect to any 
other value, even that of an identical NaN value. 

Zeus floating-point compare-branch instructions do not generate an exception on 
comparisons involving quiet or signaling NaN values. If such exceptions are desired, they 
can be obtained by combining the use of a floatingpoint compare-set ins true So, with 
either a floating-point compare-branch instruction on the floating-point operands or a fixed- 
point compare- branch on the set result. 

Because the less and greater relations arc anti-commutative, one of each relation that differs 
from another only by the replacement of an L with a G in the code can be removed by 
reversing the order of the operands and using the other code. Thus, an L relation can be 
used in place of a G relation by swapping the operands to the a>mpare-branch or compare 
set instruction. 
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Nm) Mwauctkw art provided that branch when the values are ur* irdrrcd To accomplish 
such an operation, use the reverse condition to branch over an trnmcdutety following 
unciiocfa txinsJ branch, or in the case of an if then else clause, reverse the dausea and use the 
reverse condition. 

The E relation can be used to determine the unordered condition if a single operand by 
amparing the operand with itself. 

The Mkmnng ftoaontf p*«nt compare branch relations are provided as instructions: 



Mner 


nonfc 1 B-anch taken tf values com 


pare as. 


Exception if 


code 


C-Wce 


•jnord- 
ered 


Greater 


Leu 


Equal 


unord- 
ered 


invalid 


E 




F 


F 


F 


T 


no 


no 


LG 


o 


F 


T 


T 


F 


no 


no 


I 


< 


F 


F 


T 


F 


no 


no 


GE 




F 


T 


F 


T 




no 



Comparer 

T>eaim|Mre-aet floatingpoint imtrucoons provide all the cornpariaon types supported 
branch insmicttons. Zeus compare-set filiating point instructions may opoonaUy generate 
exception on comparisons involving quiet or signaling NaNa. 

Thcfoaowtng nWing-paint compare- set relations are provided as instructions: 



as 

an 





nonic 


Result if values comoarc as. 


Exception If 


code 


C4ike 


Unord- 
ered 


Greater 


Less 


Equal 


unord- 
ered 


invalid 


E 








F 




no 


no 


LG 


o 










no 


no 


L 


< 










no 


no 


GE 


>m 










no 


no 


EX 












no 


yes 


LGX 


O 










no 




LX 


< \ 










yes \ 




GE* 


<» 










yes 


yes 



/Vftthmetic Operations 

The basic operations supported in hardware are floating point add, subtract, multiply. d»vtdc. 
Kpajc root and conversions among floarin K point f.,nnats and between floating point and 
binary integer formats. 

Software libraries provide other operations retired by the ANSI/IF.F.F floatingpoint 
standard. 
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The operations expbady specify the precision of the operatian. and round the result (or 
chccH that the mult is exact) So the specified precision at the conclusion of each operation. 
Rach of the bask operations splits operand register* into symbols of .he specified precision 
and perfcjrms the same operation on corresponding symbols. 

In addition to the basic operations, Zeus pern httis s variety of operations in which one or 
more pnxlucts are S* .Timed to each other and /or to an additional operand. The instructions 

include a fus.d multiply add (R.MVLAOD.F), c involve fR.CON-0. mama multiply 
(K Ml l.MAT.F), and scale-add (F..SCAI.ADO.F). 

The results of these operations arc computed as if the multiplies are perf o rmed to infinite 
precision, added as if in infinite precision, then munded only once, Consequent)), these 
operations perforin these operations with no nninding of intermediate results that would 
hive limited the accuracy of the result 

(touring and exceptions 

Rounding is specified within the instructions explicitly, to avoid explicit state r egister s foe a 
rounding mode. Similar!), the instructions explicitly specify how standard exceptions (invalid 
operation, &vu> *• by sco, overflow, underflow and inexact) art to be handfed. 1 

Vfhcn no rourding is explicitly na*red by the instruction (dcrVih), rouod so nearest rcwnJng 
is performed, and all ft «ting-potnt exception signals cause the standard- specified default 
result, rather than a trap. When rounding is cxpbcity named by the instruction (N: nearest, Z. 
rcro, F: floor, C ceiling), the specified rounding is performed, and flea ong~ point excepbon 
signals other than inexact cause a floating-point exception trap. When X (exact, or 
exception) is specified, ail floaang- point excepbon signals cause a floating-point execp-ion 
trap, including inexact. 

This technique assists the Zeus processor in executing t> iting- point operations with greater 
parallelism. When default rounding and exception handling control is specified in floating- 
point instructions, Zeus may safely retire instructions roOowing them, as they are guaranteed 
not to cause data-dependent exceptions Similarly, floating point imtrucoons with N, Z, F, 
or C conm>l can be guaranteed not to cause data-dependent exceptions once uSe operands 
have been examined to rule out inv Jid operations, division by zero, overflow or underflow 
exceptions. Only floating r* ant instructions with X conttc t* when exceptions cannot be 
ruled out with N, Z, F, or C con in J need to avoid retiring foDoui.tg irurructiorn u*ml the 
final result u generated. 

ANSI/1RRR standard 754-1985 specific* in£>rrnabon to be giver* to trap handiers for the 
five floanng-point exceptions. The Zeus architecture produces a precise exception, (Hie 
pn>gram counter points to the instruction that caused the exception and aO register sure is 
present) from which all the required information can be produced in software, as all source 
operand values and the specfied operation are •vailablc. 



H' S. IStetM Ml 2.439 describes Hut "Techmqur of mrorporeftn* Ao*tv* potnt mfonmbOA m*> protestor 
mstvueto*.** 
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ANSl/IFFF tundtrti "Va I^S «prcifics a set of five "sticky caccptsim" Uts, for recording 
the occurrence of cicq«*ms that ait handled by default The /cut architecture produces a 
pmnc excepoon tor «mructnm» with N, Z, F, or C cumm) for invalid opera turn, cbvuxm 
by actu, overflow ur uockrAuw cacrpcmcu and with X control ft* ail floating point 
caccpoitu, (turn whsch corrrsr*>ndtr* aocky^occpoun bin can be act Fjurcuuon of the 
same tnamamcKi with default control wi8 compute the derauh rt.'ih with round t^ neareat 
n*jnding. Moat cornr-iund opcramm not specified by the standard are not available with 
mufufcajt caceptaun controls. 

NaN handling 

ANS1/IF.RF. standard %4-l98S specifics that operations involving a signaling NaN or 
mvabd <*^crat»on shall, if mi trap cjccucs and if a floating point rcauh is so be delivered 
dehver a quart NaN aa ica rrsuh. Ilowcvcr, tt tails to specify what quiet NaN value so deliver . 

Zeus opera oom that produce a Hotting- point mult and do nut trap on invalid opcradoru 
propagate signaling NaN vajuca from operands to results, changing the signaling NaN values 
hi quart NaN values by setting the moat significant fraction bit and leaving the remaining 
bita unchanged Other causes of invalid operations produce the default quiet NaN value, 
where the sign bit is aero, the exponent field b aD one bits, the moat significant fraction bit is 
act and the rtmaing fraction bita are aero bita. For Zeus operations that produce multiple 
reauha catenated together, signaling NaN propagation or quiet NaN production is handled 
separately and independently for each rcauh symbol 

ANSI/IFJIE standard 754-1 WS specifics that quiet NaN values should be prtipagatcd from 
operand to result by the bask operations. However, it fails to specify which of several quici 
NaN values to propagate when more than one operand is a quiet NaN. In addition, the 
standard does not clearly specify how quiet NsN should be propagated ft* the multiple 
operation instructions provided in Zeus. The standard does not specify the quiet NaN 
produced as a result of an operand being • «gnaiing NaN when invalid operation exceptions 
are handled by default. The standard leaves unspecified how quiet and signaling NaN values 
m P«W^ *ough format ^inversions and the absolute- value, negate and copy 
operations. This section specifies these aspects left unspecified by the standard 

First of all, for Zeus operations that rmnlucc multiple results catenated together, quiet and 
signaling NaN propagation is handled separately and independently for each result symbol. 
A quiet or signaling NaN value in a single symbol of an operand causci only those result 
symbols that are dependent on that operand symbol's value to he propagated as that quiet 
NaN. Multiple quiet or signaling NaN values in symliols of an tipcrand which influence 
separate symbols of the result are propagated independent!)' of each other. Any signaling 
NaN that is propagated has the high-order fraction hit set to convert it to a quiet NaN. 

Por Zeui operations in which multiple symbols among operands upon which a result symbol 
is dependent arc quiet or signaling NaN;, a priority rule will determine which NaN is 
propagated Priority shall be given to the operand that is specified by a register definition at a 
luwcr numbercd (little-endian) bit position within the instruction (rb has priority over rc, 
which has priority over rd). In the case of operands which arc catenated from two registers] 
priority shall be assigned based im the register which has highest priority (lower- numbc red 
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bit poaraon wtthm the wwtructioo). In the case of be (as when the RSU J.ADD scaW 
operand hat two corresponding NaN values, or when a E.MULCP operand has NaN values 
for both real and imaginary components of a value), the value which is located at s lover 
numbered (hide cndian) bit position within the operand it to receive poonry. The 
idcnnficatv>o of t NaN as quiet or Hgnahng than not confer *ny priority for telecoon - only 
the operand position, though s signaling NaN will cause sn tnvabd operand exception. 

The s.^n hit of N.N values propagated thaJI Ik u» >r.lenvntcd if the instruction subtracts or 
negates the corresponding oxrand or (but not and) muloptiet it by or divides it by o» 
divide, it int., an .^erand * hich hat the sign bit set, even if that operand it another NaN If 
i NaN is both subtracted and multiplied by a negative value, the tign bit thai! be proMnted 
unchanged 1 * * 

l ^"'^ll* cn0 '" n thjt cooven bctw «« two floating-point formats (INFLATE and 
wF.FI .ATE), NaN values are p*>pagatcd by preserving the sign and the moat-aignifkant 
that the mmt-tignificant bit of a tignalhng NaN is tet and (for 
DF.FI.ATE) the least tignincanr fraction bit preserved it combined, via a logkal-or of all 
fraction bitt not preserved. AD additional fraction biw (for INFLATE) are tet to aero. 

!l" ^Jtr**?"* tK * , , C ° nVen from » ^mg-poim fomiat to a fixed- point format 
(5INK), NaN values produce aero values (maxanunvEkelihood estimate). Infinity values 
pnrfuce the target! repretcntahle ptmtive or negative fixed-poult value that fits m the 
detonation field When exception traps are enabled. NaN or Infinity values produce a 
ftwong-pomt exception. I'ndcrflowt do not occur in the SINK operation, they produce -1 
0 or* I. depending on rounding controls. 1 P"»uce i. 

For absolute value, negate, or copy operations, NaN values are propagated with the tign bit 
cleared, complemented, or copied, respectively. Signalling NaN values cause the Invalid 
operation execpoon. propagating a quieted NaN in corresponding tvmbol location! (default! 
or an except***, at tpccificd by the inttmction. 

Floating-point functions 

l-he foUowuig functions are defined for use within the detailed instruction definition, ui the 
following *ccd««. In these function, , n internal format repretentt infinite-precision floating 
point values st a four-element ttructure contitting of (1) . (sign bit): 0 for ,w«itive. 1 for 
neganve. (2) , (ope): NORM. ZERO. SNAN. QNAN. INFINITY. (3) e (exponent), aod (4) 
f: (fraction). The mathematical interpretation of a normal value placet the binary point at the 

Uni L $ "/fill**?""' * d|M1,cd by ,Ke cx P oncnc M) A "(2^*)«f. The function F convent a 
packed IEEE floating-point value into internal format The function PackF convent an 
internal fonnat back into IEEE floating-point format, with rounding and exception control. 

Definition 



def eo »- cMs(prect as 
case pre* of 
16: 

eb «- S 

32: 
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CO «- 0 

64: 

(6 4- It 

128: 

eo 4- IS 



efc «- etMsfprcc) as 



del fb 4- ftMstprtcj as 

lb prec - I - rt> 
cnoocf 

Rprt*. af at 
««-*P"<.| 



tf at * chen 
* af . 0 twn 

aj «- jNFiNrnr 

f K**pr»cH 
ai 4- SNaN 

-lbdb(prcct 

ai *- ONaN 
a* «- •fbestprec) 
a/4- af 



etof ae • 0 then 
#af - 0 then 
ai «- ZERO 

dse 

ai#- NOAM 

a* 4- l-e**as(pre^fbrts(pre<| 
ait- 0 II af 



MA #- NORM 

a.e «- ae~4*Us(prcc|*rb«s(precJ 
ai #- I II Mi 



def a 4- OCMUITONAN as 
aj 4- 0 
ai «- OMAN 

JL€ 4- »| 

ai 4- l 
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def a «- OEFAULTSNVJ as 
« «- 0 
At*- SMAN 
a* «- -I 
ai«~ 1 



if 



def as faddr|a.b.N| enddef 

def < «- fdddrta.txroundj as 

* a UNORM and di-NORM then 
// die are Jib w«h expone nt 
j e > txc then 
d +- a 
ei b.t 
ej «~ b s 
e* ae 
ei bi It 
else S a.e < b.e then 
di 4- ai 

d.e b.e 
di «- ai 1 1 
e b 



afcgned and fraction adjusted 



c.t «- di 
ce 4- d* 
d s ■ ej then 

cj dj 

ci di ♦ ei 
efsetf di > ei then 

C4 «- d J 

ci 4- di • ei 
etsetf di < ei then 

c j +- ej 

ci «- ei ~ di 

else 

c j r*f 
ci ZERO 

end* 

// priority is given to b operand for hUN propagation 
efsetf fb.t-SMVg| or (b.t-OM*gs| then 
c «- b 

rtetf |a*.$W4 or |a*ONAh| men 
c «- a 

eheif jWERO and bWERO then 
C4 «- ZERO 

c J |aj and bs} or (rourx**f and |aj or lu| 
// NUU. values are like wrc. but do not combine w*h ZCRO to after sign 
else* aWERO or at-NUU then 

c «- b 

eheif txuZERO or bJ-NUX then 
c «- a 

efseW a uiNFIMTY and bMNFIMTY then 
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t a s * tu **n 

C OCMULTSNAN // k*aad 

efre 

c #- a 

end* 

else* aMNFAITY then 

c «- a 
cte# biUINFMTY ttn 

c b 

etoe 

assert FALSE // should haw covered at ihe cases above 

end* 
enddtf 

del 6 «~ fhcg|aj as 

txs «- -aj 

fef «- ai 

bie «- a.e 

bi «- ai 
cnoocr 

def fsub|a,b| as feubrfaixM tnddcf 

def fojbrfttxrouncf as Caddrta>*9(bMound| enddef 

def fhubfXb| as IrsubrfaJvM enddef 

def frsubrtaArouncI as faddr|ftieg(aMxfound| enddef 

def c «- fcomJa,b| as 

tf |a.t*SNW| or fa.t»ONAN| or fb t-SMVJ, or |6.t*ONA^ then 
c U 

ebc* aMNFNTY and b-UINFINfTY then 
i aj 0 tu then 

c «- |aj-OJ ? a t 

else 

c «- E 

end* 

etsetf aMNFNIY then 

C (a^OJ > Ge I 
else* ro-lNFlNrTY then 

c «- (tu-0| ?Gl 
else* a^NOAM and txt»NOftM then 

f bj then 

c 4- |aj»OJ 7 <k I 

else 

* a.e > b.e then 
af ai 

bf +- bJ II 0*^ 

else 

af +- a.f 1 1 0**-« 
bf «- bJ 

end* 

*af • bf then 
c 4- E 
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c «~ |a j-0| • |af > trf| ? 0 : L 

end* 

end* 

efserf a^MORM tfien 

e #- |aj-q ? G l 

ehcrf bUNOSM then 

c |bLS-0| ? G L 
erserf J N/f RO and bt»ZERO then 

c «- C 

e*e 

assert FALS£ // should have covered afl the cms *oow 

end* 

>n .-a rj ■ J 

CIMMJCI 

def c +- fmupibf as 

a t-NORM and bt-NORM then 
C4 *- at • bs 

CI «- NORM 

ce a.e ♦ b.e 

< f aJ • bi 
// prtonty a g**i to b operand lor NaN propagation 
ehetf tfxt»SMV4 v (bj-ONA*4 men 

cs as * bs 

ct «- bt 

ce «- be 

ci bi 

etse* Ja t»SJW4 or |a fc-OHVJ, sVn 
cs aj • bs 
ct aj 
c.e a.e 
cf 4- *1 

ehcV Jt-Zf RO and bt*JNFINfTY then 

c DCF^JITSNAN // ftnvaftd 
e*erf j t-JNFINTTY and txt-ZERO then 

C «- DCFAUl TV4AN // ftnvatd 
HW auZEKO or bt-ZERO then 

cs #» aj * cm 

Ct ZERO 

ene 

assert FALS£ // should have covered aJ the cases above 

end* 
enddef 

def c W*rfa.b» as 

4 aUNORM and bt-NORM then 

c j aj • bs 

ct +- NORM 

ce «- a.e - be « 2S6 

ci «- fa.' II 0***| / bf 
// pnonty ts given to b operand lor NUN propagation 
ehe* fb.t»SNAN| or fbt-ONW* men 

c s aj * bis 

ct «- bt 

ce be 
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c f +- 5i 
etsW |a.t»SN*4 or |*fr»ONAr\| 
CJ *4 * fej 
(1^11 
C* «- *# 
Ci «- *J 

ctaf aWEK> and tiWfW then 

€ // ***** 

Cfec* ****Nrr and fctHNFNTY then 

C 4- DCfAUlTSNAN // *va*d 
Hk# i WfRO then 

C I «- ZERO 
c*se* a.MNFMTY tftcn 
CJ +- *4 * 54 
CI MFMTV 

wcrt MLSE // should hint covered at the cam 



def mst> +- AndmsbM as 

MA» 2»« // Largest postt* f value after matrix mu*p*y 
far J*- 0 to 



Oct m +- Pacfcf (precasounr* as 
case a.t of 
NORM 

*nst> +• ftndntsbfa I) 

m rra6»i-lbici(prec| // Hb for norm* 
rdn *«t)Utttpnx)^eO*fbtfs(prec| // hb a dcnormal 
rb frn > rdn| ? m - rdn 
rt> 1 0 tfwn 

<Wl.O 1 1 o-*> 
eadj «- 0 

cast round of 

C 

s O fnjt >^ t > 1 1 l-a.sf 0 

s «- 1 1 puf* 5 

H NONE: 

S o"**^ || - ^ || aJft" 1 

* 

# ai^Ko • 0 men 

rats* ttoaongPo*u>Vitnmet>c // inexact 



s «- 0 

2: 
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f - 0 

tndcts* 

e*4 0 



a# ♦ msb - I ♦ e*dj ♦ * oustprtd 

* fxxw* m NONE then 

* w II O^^NP^O 1 1 

the 



thai *en 2 I****** then 
Ground. NONC thtn 

round^oeartu overflow handfcng 

ffcf 

^*M»ngfWn^nihm«lc //underflow 



SHAN: 

Ground 0 NONC then 

* - a* < ftM4prtc| then 



tndc*s« 

defdef 



ONANt 

* -Ae < fotifprtc) then 

« aj 1 1 i««««ipr««t , , ai^ |<j0 1 1 oft**"**** 

e#ie 

116 ji -A*Hb«ifer»cM .0 • 0 

tncW * - " 1 1 ,ttwa ' » i++tM+m+mn 1 1 115 

ZCRO 

* +- AS II 1 1 o**Xpr«c) 
(NFINfTV: 

* AS II |«t«tpr»cf 1 1 o^ttCprvci 
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def *• *- rsmfcrtprec. a rot nd| as 
case a t <y 
NORM: 

msb «- findmsbfaf) 
rt> «- -a e 
if rb S 0 then 

arfr 4- a.f miO 0 M 0^> 
aims «~ msb • rb 

else 

case round of 
C CD 

S «- omsb-rt> | | i-^jjrt* 
F. F.Ot 

5 omsb-rt> | | (ai.s}rt> 

ISL NONE: 

, nm*^ ! ! -jif^ 1 1 ailfr' 

X 

rf atf^i o * 0 then 

raise FtoafcngfointArttvnetK // Inexact 

endif 

s 4- 0 
1. 7.0: 

s *- 0 

endcase 

v*- lOlla Woi • (OMsj 
rf v mS 5 ■ I then 

aims msb ♦ I « rb 

else 

aims msb • rb 

endif 

**i 4- v^nj rt) 

endif 

if aims > prec then 
case round of 

C D. F£>. NONE. ZX> 

a* «- a.s II l-asjP"*' 1 
C. F. N. X. Z 

raise Float>ngFointArithn«ctir // Overflow 

endcase 
erse# a s • 0 then 
ai atf r 

else 

ai -aifr 

enefcf 

ZERO: 

» 4- OP"* 

SNAN. ONAN: 

case round of 

C O. F 0. NONE. Z D 
a 4~ op»« 

cuxz 

raise FloatingPointVithmctic // Invalid 

endcase 
INFINITY: 
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case round of 

C D fJCk NONE. 2 X> 

m at N | *sjpnx i 
C F. H X / 

endcase 

endcase 

endrfrf 

(We*- rtecrestja) as 
as 0 
61 4- NOAM 

be 0 
6f I 

errfdef 

def c Tsqrestfa) as 
6 s «~ 0 
61 4- NORM 
6e 0 
6f l 

c «- frsnrsqrfrdMcxaM 
enooe? 

del c restjaj as 

* MUNO«M then 

rm6 fodmsbt*4 
*• *e ♦ ms6 - 13 
* f - **mj> mtf^l? 1 1 I 

crse 

c a 

end* 
enddcf 

def c fsqrvj as 

rf Jat-NO^M and PlW3| men 
c s 0 
C t NOAM 
' »aeo • l| then 

ce Me* 127)/ a 
Cl sqn>J 1 1 0^ 7 | 

ehe 

ce +- |ae.|J«/ * 
ci «- iqrfai It 

end* 

etscrf or ^ON^ or *WRO or ftaMNMNTTY) *nd fat-Of 

else* «at»NOft* or |a.»lNFiNmf| and M s- II then 
C DffAULTSNAN // Invalid 

else 

assen FAU£ // shoutt nave cowed ai me cases above 



VI. 
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PiQital Signal Processing 

The Zeus rwocesst* provides a set of operations that maintain the fullest poasihle use of 
128 hit data paths when operating on !;iwcr> precision fixed point or floatingpoint vector 
values. These oocranoos are useful foe several application areas, including digital signal 
po«c&ung, image processing *nd synthetic graphics. The basic goal of these operations is to 
accelerate the pernicmancc of algorithms that exhibit the following characteristics: 

Low-or«bton arfthmehr 

The operands and intermediate ra>uh* are fixed-point values represented in no greater than 
64 bit precision, Ftw floating point arithmetic, operands and intermediate result are of 16, 
M, or 64 bit precision. 

The rued point ahthmerjc operations include add, subtract, multiply, divide, shifts, and set 
on compare. 

The use of fixed-point anthmcoc permits vaiiou* forms of operation reordering that are not 
permitted m tkaatmg-ptMnt arithmetic. Specifically, commutaoviry and associativity, and 
disenburjon identitses can be used to reorder operations. Compilers can evaluate opcrabons 
to determine what intermediate precision is required to get the specified arithmetic result. 

Zeus supports several levels of precision, as well as operations to convert between these 
different levels. These precision levels are always powers of two, and are explicitly specified 
m the operation code. 

VThen specified, add, subtract, and shift operations may cause a fixed-point arithmetic 
exception to occur on resulting conditions such as signed or unsigned overflow. The fixed- 
point a ri thmetic exception may abo be invoked upon a signed or unsigned comparison. 

Sequential acre** to data 

The algonthms are or can be expressed as operations on sequentially ordered items in 
memory. Scatter gather memory access or sparsc-matm techniques are not required. 

Where an index varablc is used with a multiplier, such mulnpbers must be powers of two. 
When the index is of the fiirm: nx+k, the value of n must be a power of two, ami the values 
referenced should have k include chc majority of values in the range (Ln»l. A negative 
multiplier may also be used. 

VttttriaMc ppcratara 

The operations performed on these sequentially onlered items are identical and imlependent. 
Conditkmal operations are cither rewritten to use Boolean vanables or masking, or the 
ciirnpilcr n permitted to convert the code into such a fi*m. 
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Pato-handKno Orations. 

The characteristics of these algorithms include sequential access to data, which permit the 
use of the normal load and store operations to refer race the data. Ocdct and hexiet bads 
and stores reference several sequential items of data, the number depending on the operand 

precision. 

The discussion of these operations is independent of b>te ordering, though the ordering of 
bit fields within octlets and hcxlets must be consistent with the ordering used for bytes. 
Specifically, if big-endian byte ordering is used for the loads and stores, the figures below 
should assume that index values increase from left to right, and for little-endian byte 
ordering, the index values increase from right to left For this reason, the figures indicate 
different index values with different shades, rather than numbering. 

When an index of the nx+k form is used in array operands, where n is a power of 2, data 
memory sequentially loaded contains elements useful far separate operands. The "shuffle" 
instruction divides a tridet of data up into two hexkta, with alternate bit fields of the source 
tridet grouped together into the two results. An immediate fie'd, h, in the wsttucdoo 
specifies which of the two regrouped hcxkts to sekct for the result. For example, two 
X-SHUFFUU56 rd=rcjb32<128> operations rearrange the source tridet (cjb) into two 
hcxlets as follows: 




32-bft 2-way deal 



In the shuffle operation, two hexfct registers specify the source tridet, and one of the two 
result hcxlets are specified as hexiet register. 
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IV example above directly applies to the case where nisi When n it larger, shuttle 
operations can be used to further subdivide the scoucntul stream, Foe example, when n is 4. 
m-e need to dcaJ out 4 acts of doublet operands, as shown in the figure bcW 2 




When an array result of computation n accessed with an index of the form nx+k, for n a 
power of 2, the reverie of the "deal" operation needs to be perforrned on vectors of results 
to interleave them for storage in sequential order. The -shuffle- operation interleaves the bit 
fields of two octlets of results into a single hexkt For example a XS\ IJFFI JL16 operation 
eombtnes two octlets of doublet fields into a hcxlct as follows: 
















3SQ 



16-btt 2 -way shuffle 



Rjc larger values of n, s aeries of shuffle operations can be used to combine additional sets 
of fields, similarly to the mechanism used fbr the deal operations. For example, when n is 4, 
w* need to shuffle up 4 acts of doublet operands, as shown in the figure below: ' 



'ill C CD 




16-btt 4-way shuffle 



When the index of a source array operand or a destination array rcjult is nested, or in other 
words, if of the form nx+k where n is negative, the elements of the array mutt be arranged 

1X * of munr of a four w»y (McinU found m ifcc ^mla: l>*tt»J l'rocr%un« 

AppWiOoni: Co u rt m ow of Color to Monochrome 

\\n cumplr of mt in* of • four w»r thuffV can tic found in the >pprndi>: l>«t!»I S*n*J IVxetun. 
AppoonunK Conwrmon of Monochrome to Color 
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in reverse order. 11k "iwizzk" opcraofin cm reverie the order of the bit fields in t hexkt 
For example, a XSVHZZ! H njsfc. 127,1 12 operabon revet ki the doublets within a hexJct: 




In s<ime ciscs. it is desirable to use a group instruction in which one or more operands is a 
single value, not an array. The "swizzle" operabon can aJ»o copy operands to multiple 
locati<»m within a bexki. For example, a X .SVC"! XX! J- 15,0 operauon copies the kra^ordcr 
16 bits to each double within a hexlet. 

Vanaoons of the deal and shuffle operations are also useful fo» converting from one 
precision to another. This may be required if one operand is represented in a different 
precision than another operand or the result, or if computation must be performed with 
intermediate precision greater than that n( the operands, such as when using an integer 

multiply. 

VCnen converting from a higher prccixi'm to a lower precision, specifically when halving the 
precision of a hexlet of hit fields, half of the data must be discarded, and the bit fields 
packed together. The "oimprcss" operation is a \ariant of the -deaf' operabon, in which the 
operand is a hexlet, and the result is an KtJct. An arbitrary half-sized sub-field of each bit 
field can be selected to appear in the result. For example, a selection of bits 19..4 of each 
quadlct in a hexlet is performed by the XXOMPRESS rd=rc.16.4 operauon: 

E 



T 








SSSSSI 







Compress 32 bits to 16, with 4 -bit right shift 



VCncn converting from lower precision to higher- precision, specifically when doubling the 
precision of an octlet of bit fields, one of several techniques can be used, cither multiply, 
expand, or shuffle. Each has certain useful pn>pcrtics. In the discussion below, m is the 
precision of the source operand. 

1 ne multiply operation, desenbed in detail hclow, automab. illy doubles the precision of the 
result, so multiplication by a constant vector will simultaneously double the precision of the 
operand and multiply by a constant that can be represented in m bits. 
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An operand can be doubled in precision and shifted left with me 'expand" operation, wh;ch 
is esacnoaD) the reverse of the "ctimprcss" operate n For example the MfXPAND 
rd=rr,l6.4 expands from 16 bits to 32. and shifts 4 bits kft 




The "shuffle" operation can double the p.ccision of an operand and multiply it by I 
(unsigned only), 2" or 2*41. by specifying the sources of the shuttle operant to be i 
mocd regMcr and the source operand, the source operand and xeru, or Unh to be the 
source operand. When multiplying by 2m, a constant can be freely added so the source 
operand by spcofpng me constant as the right operand to the shuPV. 

Arithmetic Ooeratimy? 

The characteristics of the algorithms that affect the arithmetic operations most directly are 
kry^reoston arithmetic, and vcetnrixablc operations. The feed-point arithmetic operations 
provided are moat of the functions provided in the standard integer unit, except for those 
that check conditions. These functions include add, subtract, bitwise Boolean operations* 
shift, set on condition, and multiply, in forms that take packed sets of bit fields of a specified 
so* as operands. The floating-point arithmetic operations provided are as complete as the 
scalar floatingpoint arithmetic set. The result is generally s packed set of bit fields of the 
same sue as the operands, except that the fixed-point multiply function intrinsically doubles 
the precision of the bit field. 

Conditional operations arc provided only in the sense that the set on condition operations 
can be used ro construct bit masks that can select between alternate vector expressions, 
using the bitwise Boolean operations. AD instructions operate over the entire oedet or hexJrt 
operands, and produce a hexlct result. The sizes of the bit fields supported are always 
powers of two. 

Galois Field Operations 

Zeus provides a iceneral software sok**vm to the mmf common opera boot required for 
Gak*s Field arithmetic. The mstnsctioo? t nsvided include a polynomial multiply, with the 
pol/rvjrntal specified as one register operand This instnicrior. can be used to perform CRC 
generation and checking. Reed* Solomon code generation and checking, and spread spectrum 
encoding and decoding. 
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Software Conventions 



The following section describes software cuovcntumj that are to be employed at software 
module boundaries, in order to permit the combination M sctwatcJv compdrd code and to 
provide srsnuird interfaces between apphcatif >n, horary ami tyttcm software. Renter usage 

and procedure call conventions may he mudined. umphBed or optimized when a amgje 
o>rnpiJanon encloses procedures within a compilabiin unit »j that the proccdurea have no 
external interfacei. For ciample. inter i«l procedures may perrm? a greater number of 
renter pa*»ed parameter*, or ^avc regscer* allocated to avtud the need to save registers at 
procedutc boundaries, or may use a single stack or data pointer allocation to suffice for more 
than one level of procedure call. 

Register Usage 

All Zeus registers are idenocal and general purpose; there is no dedicated aero- valued 
register, and no dedicated floanng potnt registers However, some proixdure-^iO-oricnted 
instructions imply usage of registers aero (0) and one (1) in a manner consistent with die 
conventions described below. By software airrvtjtbori, the f**v- specific general i 
used in more specific wars. 



C 



register 


assembler 


usage 


how saved 


number 


names 




0 


*>, r0 


link pointer 


carter 


1 


dp r rl 


data pointer 


carter 


2-9 


r2-r9 


pararr tten 


caller 


10-31 


rl0-r3l 


temporary 


caller 


32-61 


r32-r61 


saved 


cafec 


62 


fp, r62 


frame pointer 


canee 


63 


sp. r63 


stack pointer 


caflee 



register usage 



At a procedure call boundary, registers are saved either by rise caller or callee procedure, 
which pmvsdes a mechanism for leaf procedures to avtad needing to save regis terr 
Compilers may choos* to allocate variables into caJtter or callee saved registers depending on 
how their lifetimes overlap with procedure calk. 

Procedure Callinc Conventions. 

Pn<cdurc parameters are normally a£<>catcd in registers, starting from register 2 up to 
register 9. These registers hold up to 8 parameters, which may each be cf any size from one 
byte to sixteen bytes (hcxlct), including floating point and small structure parameters. 
Additional parameters are passed in memory, allocated on the stack. For C procedures which 
use vsnugs.h or stdarg.h and pass parameters to further procedures, the compilers must 
leave room in the stack memory allocation to save registers 2 through 9 into memory 
contiguously with the additional stack memory parameters, so that procedures such as 
_dt»prnt can refer to the parameters as an array. 
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INncdurc return value* are also allocated in registers, staging tmm register 2 up in regmcr 
•> larger values arc pas«rd in rncmory. alhicattd on the stack. 

There are several pointers maintained in registers for the procedure calling conventions In 
•p. dp. fp. 

The Ip raster contains rhe address to which the callee shiHild return to at the conclusion of 
thr pnicediire. If the procedure is also a caller, the Ip register will need to be saved mi the 
stack, once, befure any procedure call, and restored, once, after all procedure calk IV 
prtjccdurc returns with a branch instruction, specifying the Ip register. 

1>k sp register is used to nwm addresses to save parameter and other register, maintain 
local vanables. i.e., data that is allocated as a UFO itack. For pn*edures that requite a stark, 
normally a single allocation is performed, which allocates space for input parameters, local 
vanables, saved register*, and output parameters all at once. The sp register is ahvays hcxlct 
aligned. 

The dp register is used to address pointers, literals and static variables for the procedure. The 
dp tcgis*r points to a small (approximately 4096-cntry) array of pointers, btctals, and 
t^zcJfy allocated variables, which is used locally to the procedure. The uses of the dp 
register arc similar to the use of the gp register on a Mips R series pneessor. except that 
cadvprocedurc may haw a different value, which expands the space addressable by small 
offsets frorr this pointer. This is in important distinction, as the ofTsct field of Zeus load 
and store instructions arc only 12 bits. The etimpUer may use additional registers and/or 
indirect pointers to address larger regions for • single procedure. The compiler may also 
share a single dp register value between procedures which art compiled as a single unit 
Concluding procedures which are externally callable), eliminating the need to save, modify and 
restore the dp register for calls between pniccdurcs which share the same dp register value. 

1-oad- and store- immediate- aligned instructions, Reifying the dp register as the base 
register, are generally used to obtain values from the dp region. These instructions shift the 
immediate value by the logarithm of the si/c of the operand, so loads and stores of large 
operands may reach farther from the dp register than of small operands. The size of the 
addressable region is maximized if the elements to be placed in the dp region are sorted 
according to sixc, with the smallest elements placed closest to the dp base. At poinu where 
the fixe changes, appropriate padding is added to keep elements aligned to memory 
boundancs matching the sixc of the elements. Using this technique, the maximum size of the 
dp region is always at least 4096 items, and may be larger when the dp area is composed of a 
mixture of data sixes. 
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The dp rt j(utcr mechanwn iho permit, code to be .hami. with each ...tic inmnce of the 
dp re** u-uned u> , diflrnn, .ddm. ,n memory. In con.uncnon with^W 

S£r f^r *■ % ^tjs 

To implement an tme, module (.ep^ch; cwnptlcd) procedure call. ,hc b rcp.ter i. k«ulcd 
wth ,h, entry p.«rw of ,hc procedure, and the dp tester k»dcd with L valoVof £ dn 
re*.*, reared for the procedure. TVicic two value, arc k,ced adjacent to eh,!?., a 
patr of ocde, qu^nwe, j„ lhe dp region fi„ ^ ^ pfOCedurc . Fwf , 
.....t rnodule pn«cedure call, d* linker fill. ,n the 3-c. .« link time. E £ 
mcch.n,Mn aho provide. for dynamtc knktn^. by initially f,Uin K in the lp .nd dpS in * 
tZETZ " *r*™" ^ ^-n-mie bnker c.n tJe the content, if he 

to fill ,n the r-nnter. .nd rewmc cxccuo.kv Specially, the lp value i. initially k, to pom. to 
the lp and dp value, in the dp region of the «L* procedure. The kJcn,£ 
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can be d»o»vcrcJ from a string Mowing the dp pointer. < r a separate taliL, indexed by the 
dp pointer. 

The fp register it used to address the stack frame when the stack sixc vane, during execution 
of a procedure, such as when using the GNU C aHoca funcoori. When the stack aixc can be 
determined at compic tunc, the jp register is used to address the stack frame and the fp 
register may be uaed for any other general purpose a* a caHee saved register. 

Typical static United. Intra-mo/lule calling sequence: 
caBrr (non-leaf). 

c ** r sp»-we // allocate caller stack frame 

save onoinaf Id reoister 

... (caiee using same dp as caier) 
ailMlCI caiee 

- (caiee using same dp as caHer) 
aUNMU caiee 

LI^A <p-sp.off // restore original ft? register 

AAOOl sp»sue // deallocate caier stack frame 

B ¥> // return 

caDce (leaf): 

caiee M (code using dp) 

8 a? // return 

Procedures that are compiled together may share a common data region, in which case there 
is no need to save, kiad. and restore the dp region in the callcc. assuming that the callcc docs 
not modify the dp raster. The porclarivc addressing of the B.UNK.I instruction permits 
the code region to be p<>nnon- independent. 

Minimum at atlc-f Inked. Jntra-modufe calling sequence! 

caller (non-leaf): 

caier: ACOPf r3l«lp // save original fp register 

- (caiee using same dp as caller) 
aUNJU caiee 

- (caiee using same cap as -aier) 
BUNKJ caiee 

* r3l //return 

caOee (leaf): 

caier ... (code usmg dpi rjl unuseol 

e *> // rtturn 

When aO the caJJce procedures are intra module, the stack frame may also be eliminated 
from the caller procedure by using -tcm|«»rary" caller tave registers not utilized by the caJlee 
leaf procedures. In acldinon to the !p value indicated a»M>vc t this usage may include other 
valuer and variables that live in the caller procedure across calJee procedure calls. 
Typical dynamic-linked, Infr-modul* filing sequence; 
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caller (non-leaf): 






caUer: AADOi 


spw*stre 


// aMocMe caNer stack frame 


UMA 


tUfxoff 


// save original lp reguter 


SI 64 A 


dp.sp.ofT 


// save c«pnaf dp register 


... (code using dpf 






U64/\ 


<p«dpoc7 


// toad p 


U.64A 


dp*dp.orT 


// load dp 


BUNK 




// invoke caflee procedure 


U64A 




// restore dp reg»ster from stack 


{coot using dpf 






U.64A 




// restore original register 


AADDI 


Sp-Mf 


// deallocate caller stack frame 


B 


Ip 


// return 


callec (l«f): 






carter ... (code using dpf 






8 


•p 


// return 



The load instruction is required in the caller following the procedure call to restore the dp 
register. A second load instruction also restores the lp register, which may be located at any 
point between the last procedure call and the branch instruction which returns from the 
procedure. 

System and Privileged Library Call s 

It is an objective to make calls to system facilities and privileged libraries as similar as 
possible to normal procedure calls as described above. Rather than invoke system calls as an 
exception, which involves significant latency- and compbeaoon, we prefer to use a modified 
procedure call in which the process privilege level is quiet 1) raised to the rcquirvd IcvcL To 
provide this mechanism safer)-, interaction **th the virtual memory system is required. 

Such a procedure must not be entered from anywhere other than its kgiomare entry point, 
to prohibit entering a procedure after the point at which security checks are performed or 
with invalid register contents, odierwise the access to a higher privilege level can lead to a 
security violation. In addition, the procedure generally must have access to memory data, for 
which addresses must be produced by the privileged code. To facilitate generating thc*e 
addresses, the branch gateway instruction allows the privileged code procedure to rely the 
fact that a single register has been verified to contain a pointer to a valid memory region. 

The branch gateway instruction ensures both that the prmredure is invoked at a proper entry 
point, and that other registers such as the datr. pointer and stack pointer can be pn>pcrfy scl 
T«, ensure this, the branch -gateway instruction retrieves a -gateway" directly from the 
protected virtual memory space. The gateway contains the virtual address of the entry point 
of the procedure and the target privilege level. A gateway can only exist in regions o( the 
virtual address space designated to contain them, and can only be used to access privilege 
levels at or below the privilege level at which the memory rcg.on can be written to ensure 
that a gateway cannot be forged. 

The branch-gateway instruction ensurrs that register 1 (dp) contains a valid pointer to the 
gateway for this target code address by comparing the contents of register 0 (lp) against the 



-44- 



MkroUniry 



'/.cms Sv\tcm ArchiWt'.!." 



gateway r?tricvcd trim memory tnd causing an exception trip if they do rot match. By 
ensuring that register I points to the gateway, auxiliary m formation, such as the data pointer 
and stack pointer can be set by loading vahics located by the contents of register 1. For 
example, the eight bytes roDowmg the gateway may be used as a pointer to a data region for 
the procedure. 

Before executing the branch-gateway instruction, register I must be set to point at the 
gateway, and register 0 must be set to the address of the target code address phis the desired 
privilege level. A "L.1.64.LA tO=sr! t (T instruction b one way to set register 0. if register 1 
has ahead)* been set, bur any means of getting the comet value into register 0 is permissible. 




rS wS x2 99 



r2 w2 xS 00 



;i:.-.-.:.:^t 



r2 w2 x3 gS 




Gateway with pointers to code and data spaces 



Similarly, a return from a system or privileged routine involves a rcducbon of privilege. This 
need not be carefully controlled by architectural facilities, so a procedure may freely branch 
to a ^privileged code address. Normally, such a pnKedure restores the stack frame, then 
uses the branch-down instruction to return. 
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ctOcr 
carter 



Typical tyriamc-ttr***. tnter-gateway calling f#c}u«nc«: 



// allocate caier stack frame 



//load* 
//load dip 



a*doi 


sp#-ifre 






Si.64«A 






t-<Jp.or7 


UM4A 


dp*dp.of 


aGATE 






djxjrxoT 


- *ode uskog dp| 




U64A 




AADCX 


8 





// 
// 

// return 



original tp 



caDee (non-leaf): 




calce U6AA 


dp-dpioir 


UMA 


sp.dp r on* 


U64A 


sp-dctort 


UMA 


tupjoff 


SXMA 


dcufJioff 


-Mongclpf 




UMA 


ofotrxofT 


Kode usng dpf 




LI 64 A 




UMA 


sp-spiorT 


aoowN 


*> 



// load dp with data pointer 
// new stack pointer 



// restore original *> register 
// restore ongtnM sp register 

cxilec (leaf, no stack): 

carter: _ fusing dp| 

&OOWN p 

It can be observed that the calling sequence is identical to that of the iptcr-modulc calling 
sequence shown above, except for the tue of the B.GATF instruction instead of a RL1NK 
instruction. Indeed if » B.GATE instruction is used when the privilege level in the lp 
register is not higher than the current privilege level the B.GATE instruction performs an 
identical function to a BJJNK. 

The callec, if it uses a suck for local variable allocation, cannot necessarily trust the value of 
the sp passed to it, as it can be forged Similar!), any pointers which the caUee provides 
should not be used directly unless it they arc verified to point to regions which the caDce 
should be permitted to address. This can be avoided by denning application pn>gnuTuning 
interfaces (APIs) in which all values are passed and returned in registers, or by using a 
misted, intermediate privilege wrapper routine to pass and return parameters. The method 
described below can also be ujed. 

It can be useful to have highly privileged code call lcss privilegerl routines. For example, a 
user may request that errors in a privileged routine be reported by invoking a user-supplied 
crmr-k>fcging routine. To invoke the procedure, the privilege can be reduced via the branch- 
down imtrucbon. The return from the procedure actual}) requires an increase in privilege, 
which must be carefully controlled This is dealt with by placing the procedure call within a 
lower-privilege procct!«irc wrapper, which uses the branch-gateway instruction to return to 
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the higher pmnlcgc region after the caD thmugh a secure .c-entry point. Special care must be 
taken to ensure that the less privileged routine is not permitted to gain unauthorucd access 
by corrupooo of the stack or saved registers, such as by saving all registers and setting up a 
new stack frame (or restoring the original kwerpeivilcgc stack) that may be manipulated by 
the lesvprivilcgcd routine. Finally, such a technique is vulneraUe to an unprivileged routine 
attempting to use the re-entry point direct))', so it may be appropriate to keep a privileged 
state variable whkh controls permission to enter at the re-entry point. 

Instaicrion Scheduling 

The next section describes detailed pipeline organization for Zeus, which has a significant 
influence on instruction scheduling. Here we will elaborate some general rules for effective 
scheduling by a compiler. Specific information on numbers of functional units, functional 
unit paralleli&m and latency is quite implementation-dependent, values indicated hex are 
valid for Zeus's first implementation. 

Separate Addressing from Execution 

Zeus has separate function units to perform addressing operations (A, U S, B instructions) 
from execution operations (G, X, E, W instructions). When possible* Zeus will execute all 
the addressing operations of an instruction stream, deferring execution of the execution 
operations until dependent load instructions are completed. Thus, the latency of the manor)' 
system is hidden, so long as addressing operations themselves do not need to wait for 
memory. 

?QftW?rg Piling 

instruction! should gene rail)- be scheduled to that previous operations can be completed at 
the time of issue. Then this is not possible, the processor inserts sufficient empty cycles to 
perform the instructions precise!)' - explicit no-operation instructions are not required. 

MultJptg 

Zeus can issue up to two addressing operations and up to two execution operations per cycle 
per thread Considering functional unit parallelism, described below, as many of four 
tratruction issues per cycle are possible per thread. 

Functional Unit raraJfclfrni 

Zeus has separate function units foe several classes of execution operations. An A unit 
performs scalar add, subtract, boolean, and shift-add operations for addressing and branch 
calculations. The remaining functional units are execution resources, which perform 
operations subsequent to memory loads and which operate on values in a parallel, 
partitioned form. A G unit performs add, subtract, boolean, and shift-add operations. An X 
unit performs general shift operations. An R unit performs multiply and filiating -point 
operations. A T unit performs table-look up i ipcraui >ns. 
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Each instruction uses one or more of these units, according to the tabic below. 



Instruction 


A 


G 


X 


E 


T 


A 


X 










B 


X 










L 


X 










S 


X 










G 




X 








X 






X 






E 






X 


X 




^.TRANSLATE 1 


X 








X 


WAWLA4AT 


X 




X 


X 




wswnch 


X 




X 







Latency 

The btency of each functional unit depends on what operation is performed in the unit, and 
where the rcauh is used. The agpeasrve nature of the pipeline makes it <£fficuh to 
characterize thr latency of each operation with a single number. Because the addressing unit 
is decoupled from the execution unit, the btency of load operations is generally hidden, 
unlca the result of a load tnsoucrion must be returned to the addressing unit Store 
instructions must be able to compute the address to which the data is to be stored in the 
addressing unit, but the data will not be irrevocably stored until the data is available and t b 
valid m retire the store tratrucoon. However, under certain conch bona, data may be 
ttKwarucd from t store instruction to subsequent load instructions, once the data b 
available. 



The latency of each of these units, for the initial Zeus implementation b indicated below: 



Unit 


instruction 


Latency nies 


A. 


A 


1 cycle * " 




L 


Address operands must be ready to issue. 
4 cycles to A unit 0 to G. X E. T units 




S 


Address operands must be ready to issue. 
Store occus when data is ready and instruction 
may be retired. 




B 


Conditional branch operands may be provided 
from the A unit (64-bit values), or the G unit (128- 
bit values). 4 cycles for mispredicted branch 




W 


Address operand must be ready to issue. 


G 


G 


1 cycle 


X 


X, WSWITCH 


1 cycle for data operands. 2 cycles for shift 
amount or control operand 


E 


E. W.MULMAT 


4 cycles | 


T 


W.TRANSLATE 


1 cycles 
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Pipeline Organization 

Zeus performs til tmmaetions « if executed one byline, in <*dcr. with precise excepdons 
always available. C>nscqucnrly, aide that ignores the subsequent discus*** of Zeus pipeline 
rniplertKntarions win soil perform comedy. However, the highest pcrf •rmance of the Zeus 
proceaaor is achieved only by matchup die ordering of instrucrions to the characteristics of 
the pipehnc. In the following discussion, the genera) charactensucs of aO Zeus 
iniplcmentations precede discussion of specific choices for specific implementations. 

Ctesycal Piping Sftvcturgs 

Pipelining in general refers to hardware structures that overlap various stages of execution of 
a series of tnsmictiorts so that the time required to perform the series of instructions is leas 
than the sum of the times required to pcrfi*m each of the instrucrions separately. 
Additionally, ^penncs cany to connotation of a collection of hardware structures which 
have a simple nroermg and where each structure perform a specialised runctjon. 

The diagram below shows the timing of what has become a canonical pipeline structure for a 
simple RISC processor, with time on the horizontal axis increasing tn the right, and 
successive instructions on the vertical axis going downward. The stages I. R. E, M. and W 
refer to units which perform instruction fetch, register file fetch, execution, data memory 
fetch, and register file write. The stages are aligned so that the resuh of the execution of an 
instruction may be used as the source of the execution of an immediately ft>Dowing 
insuuerion, as seen by the fact thst the end of an R stage (bold in hnc 1) lines up with the 
beginning of the E stage (bold in line 2) immediately bekiw. Also, it can be seen that the 
rest* of a load operation executing in stages E and M (bold in hnc 3) is not available in the 
bnirtediatery folfcmnng instruct™ (bne 4). but may be used two cycles later (line 5); this is 
the cause of the load delay slot seen on some RISC processors. 
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E 


M 


W 










1 


R 


E 


M 


W 
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1 


R 


E 


M 


W 
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1 
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E 


M 


W 1 


s 
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R 


E 


M | W | 



canonical pipeline 



In the diagrams Mow, we simplify the diagrams somewhat by eliminating the pipe stages for 
instruction fetch, register file fetch, and register file write, which can be undcrstoiHl to 
precede and frWlow the portions of the pipelines diagrammed. The diagram above is shown 
again in this new format. showing that the canonical pipeline has very little overlap of the 
actual execution of instructions. 



EE 



M 



M 



M 



canonical pipeline 



M 



M 
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uipcrscalar pipeline it one cap able of simoJrancousry issuing two or more trotruction* 
which arc independent, in that they can be executed tn cither order and separately, producing 
the tame result aa if they were executed aerially. The diagram below shows a two-way 
lupcncaJar processor, where one instruct* m may be a register-* >- register operation (using 
itagc F) and the other may be a register-to register operation (using stage A) or a memory 
load or store (using stages A and M). 



1 


E 






A 


M 








E 








A 


M 








E 




A 


M | 



] 



A superpipelined pipeline ta one capable ta issuing ample instructions freoucntty enough that 
the result of a simple instruction must be independent 'A the imrncckatcjy following one or 
more instructions. The diagram below shows a two-^Jc superpapcHned tnirJcmcmaoon: 

•rem 



I g I g I 



Ml 



I E « M 1 



superpipelined pJpcHnc 



In the diagrams below, pipeline tugea are labelled with the type of instruction that may be 
performed by that stage. The posititon of the stage further identifies the function of that 
stage, as ft* example a load operation may require several 1, stages to complete the 
instruction. 

Surerflrioq Pipeline 

Zeus architecture provides for implements oon* designed to fetch and execute several 
instructions in each clock cycle. For a paricular ordering of tnitrucbon types, one 
instruction of each type may be issued in a single clock cycle. The ordering reouired is A, L, 
E, S, B; in other words, a register-to- register address calculation, a memory load, a register- 
to- register data calculation, a memory store, and a branch. Because of the organisation of the 
pipeline, each of these instructions may be serially dependent Instructions of type R include 
the fixed-point execute-phase instructions as well as floating-pot.it and digital signal 
processing instructions. We call this form of pipeline organization "supmtring,"* because of 
the ability to issue a string of dependent instructions in a single dock cycle, as distinguished 
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rr»*« *uprrMT4i-r of superpipelined or^m/itmnv ubivh can « inly issue seis of independent 
in>tnnnt«m 

These instructions lake from one to tour Ovky tit latency to execute. and a branch 
picdicn«»n mechanism is used to keep the pipeline tilled Tie diagram below ;how* a box tor 
tK interval between issue of each instruction and the completion. Bold letter* mark the 
cutical latency paths of the instructions, that is, the penods between the required ava»lahi!iry 
of the m Hirer registers and the earliest availability of the result renters. Tb c A I. cnticaJ 
lateno path it a special case, in which the result of the A insrrucnon may be used as the base 
tcgistci ot the I. insrrucnon without pcnalry. K instructions may require additional c>tles of 
latency for certain operations, such as fixed point multiply and divide, floating point am! 
tjgual signal pmccssing operations. 

t 



j 
i 

4 

S 
* 

7 
• 

9 

10 
II 
II 
IJ 
14 
IS 



B 



±3 



B 



J3 



B 



Supefstrinq pipeline 



Suoersorina Pipeline 

/cus architecture provides an additional refinement to the organisation defined above, in 
which tiie time permitted by the pipeline to service load operations may be flexibly extended. 
Thus, tin front of the pipeline, in which A, L and H type instructions are handled, is 
decoupled from the back of the pipeline, in which F„ and S type instructions are handled. 
This decoupling occurs at the point at which the data cache and its backing memory is 
referrnred; similarly, a FIFO that is filled by the instruction fetch unit decouples instruction 
cache references frcjm the front of the pipeline shown above. Iric depth of the FIFO 
structures is implementation-dependent, i.e. not fixed by the architecture. 
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l*hc diagram below indicates why we call this pipeline organization feature "supcrspring/* an 
extension of our supcrsrhng ocganixat»on 




With the super lpnng organixation, the latency of load ins t rue tx mi can be hidden, at execute 
instructions are deferred until the results of the load are available. Nevertheless, the 
execution unit itill pn *cs*cs instrucuons in normal order, and pnivides precise exceptions. 
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Suoerthread Pipeline 

'I Tm technique is not employed in the inttul /.cus implementation, though it was present in 
an cither prototype implementation. 

A difficulty of superpipelining is that dependent operations must be separated by the latency 
tif the pipeline, and for highly pipelined machines, the latency of simple operations can be 
quite significant, Ihc Zeus M supenhread" pipeline provides for very highly pipelined 
implements™™ by alternating execution of two or more independent threads. In this 
context, a thread is the state required to maintain an independent execution; the architectural 
state required is that of the register file contents, program counter, privilege level, local TB, 
and when required, exception status. Ensuring that only one thread may handle an exception 
at one time may minimixc the latter state, exception status. In ortk to ensure that all threads 
make reasonable forward progress, several of the machine resources must bt scheduled 
fairly. 
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An example of a resource thai is cntxaJ that it he fairly shared is the data memory /cache 
subsystem. In a prototype implemcmaoon, Zeus is able to perform a load operation only on 
every second cycle, and a store operation only on every fourth cycle. Zeus schedules these 
fixed timing resources fairry by using a round-robin schedule for a number of threads that is 
rcJahveh prime to tSe resource reuse rates. For this implementation, five simultaneous 
thread* of execution ensure that resources which may be used every two or four cycles are 
fairry shared by allowing the mstructions which use those resources to be issued «mly on 
every second or fourth issue slot for that thread. 

In the diagram bclnw, the thread number which issues an instruction is indicated on each 
duel cycle, and below it, a list of which functional units may be used by that instruction. 
Irie diagram repeats every 20 cycJca, so cycle 20 is similar to cycle 0, cyele 21 is similar to 
cycle I. etc. This schedule ensures that no resource conflict occur between threads fiw these 
resource*. Thread 0 may issue an E, U S or B on cycle 0, but on its next opportunit>\ cycle 5, 
may only issue E or B, and on cycle 10 may issue E, K or B, and on cycle IS. may issue E or 
B. 
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Sopenhread pipeline 



J 



VC*hen seen from the perspective of an individual thread, the resource use diagram looks 
similar to that of the collection. Thus an individual thread may use the load unit every two 
mstructions, and the store unit every four instructions. 
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Superthrcad pipeline 
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A Zeus Superthrcad pipeline, with 5 simultaneous threads of execution, permits simple 
operations, such as register- to- register add (G.ADD). to take 5 cycles to complete, allowing 
for an extremely deeply pipelined implementation. 

Simultaneous Multithreading 

The inrial Zeus implementation performs simultaneous multithreading among 4 threads. 
Each of the 4 threads share a common memory system, a con mon T unit. Pair* of threads 
share two G units, erne X unit, and one E unit. Each thn ad individually has two A units. A 
fait allocation scheme balances access to the shared resources hy the four threads. 
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Branch/fetrh Prerfiftfap 

Zeus dies not have delayed branch instructions, and »fi re'ies upon branch or fetch 
prediction to keep the pipeline fuU around unconHioonaJ and condioonaJ branch 
tnstrucoons. In the simplest form of branch prediction, as in Zeus'* first implemcntabon, a 
taken conditional backward (roward a lower addrcs*) branch predicts that a future execution 
of the same bnnch will be taken. More elaborate prediction may cache the source and target 
addtesses of multiple bianche*. both conditional and uncondinonaJ, and both forward and 
revcrv. 

The hardware prediction meet an ism is tuned for optimizing conditional branches that ck>se 
loops or express frequent ahemanves, and will generally require substantially more cycles 
when executing condioonaJ branches whose outcome is not predominate}) taken or not- 
taken. For such cases of unpredictable conditional results, the use of code that avoids 
conditional branches in favor of the use of compare set and multiplex instructions may 
result in greater performance. 

I'ndcr some conditions, the above technique may not be applicable, for example if the 
conditional branch "guards" code which cannot be performed when the branch is taken. 
This may occur, for example, when a condioonaJ branch tests for a valid (non-zero) pointer 
and the condioonaJ code performs a load or store using the pointer. In these cases, the 
conditional branch has a small positive offset, but is unpredictable. A Zeus pipeline may 
handle liu* c-»c as if the branch is always predicted to be not taken, with the recovery of a 
misprediction causing cancellation of the instructions which have already been issued but 
not completed which would be skipped over by the taken conditional branch. This 
"conditional skip" opnmixation is performed by the initial Zeus implementation and 
requires no specific architectural feature to access or implement. 

A Zeus pipeline may also perform "branch- return" oprimixar>m, in which a branchlink 
instruction saves a branch target address that is used n> predict the target of the next 
returning branch instruction. This optimizaoon may be implemented with a depth of one 
(only one return address kept), or as a stack of finite depth, where a branch and link pushes 
onto the stack, and a branch- register pops fn.m the suck. This optimization can eliminate 
the mispredicaon cost of simple procedure calls, as the calling branch is susceptible to 
hardware prediction, and the returning branch is predictable by the branch-return 
optimization. Ijke the conditional- skip <ipu<nixahon described above, this feature is 
performed by the initial Zeus implementation and requires no specific architectural feature 
to access or implement. 

Zeus implements two related instructions that can eliminate or reduce branch delays for 
conditional loops, conditional branches, and computed branches. The "branchhint" 
instruction has no effect on architectural state, but informs the instruction fetch unit of a 
potennal future branch instruction, giving the addresses of both the branch instruction and 
of the branch target The two forms of the instruction spc ify the branch instruction address 
relative to thr current address as an immediate field, and one form (branch-hint-immediate) 
specifies the branch target address relative to the current address as an immediate field, and 
the other (branch-hint) specifics mc branch target address fiom a general register. The 
branch hint immediate instruction is generally used to give advance notice to the instruction 
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fetch unit of a branch condir* *ud imtructxHi, mi that instructions it the target of the branch 
can be fetched m advance of the brar*h-concbDonaJ instructs**, reaching the execution 
ptpehne. Placing the branch hint aa early as possible, and at a point where the extra 
instruct*** will not reduce the execution rate ormmuca rKrrforrnance. In other words, an 
opdmurirax ct*nptler thouid insert the branch -hint instruction as early as possible in the basic 
Nock where the parcel will contain at moat one other ••front-end" instruction. 

Afltiifonal Load and Exeojte Resource 

Studies of the chmmic dbrribuoon of Zeus instructions on varitnis benchmark suites 
indicate that the most rrcouctidy issued instruction classes are load instructions and execute 
instruct***. In a hij^rjerfbrrnance Zeus implementation, it is advantageous to consider 
execution ptpdmcs in which the ability id target the machine resources toward issuing load 
arvl execute instructions is increasccl 

One of the means to increase the ability to issue execute-class instructions is to provide the 
mClm ***** executc in * txuctiom m ■ *ngfe-usuc string. The execution unit actually 
teaurrcs several duonct resources, so by paro tuning these resources, the issue capability can 
be ncrtaacd without increasing the number of runcbonaJ units, other than the increased 
register file read and write porta. The parutioning favored for the initial implementation 
places aB mstnictkm that involve shifting and shuffling in one execution unit, and all 
mswe^udut involve awtepficanon, bduc&g fixed point and floating-point multiply and 
tdd to another unit Resources used for implementing add. subtract, and bitwise logical 
opetmtiom may be du p lic ated , being modest in size compared to the shift and multiply units, 
or shared between the two units, aa the operations have low-enough latency that two 
operaoiws ought be pipelined within a single issue cycle. These instructions must generally 
be tnderxndent, except perhaps that two simple add, subtract, or bitwise logical instructions 
may be performed dependendy, if the resources for executing simple instructions arc shared 
between the execution units. 

One of the means to increase the ability to issue load-class instrucbons is to pmvidc the 
means to issue two load instructions in a single-issue string. Thii would generally increase 
the resources rcoutred of the data fetch unit and the data cache, but a compensating solution 
u to steal the resources for the store instruction to execute the second load instruction. 
Thua. a sang)c-issuc string can then contain either two load instructions, or one load 
mstructiui and one store instruction, which uses the same register read ports and address 
computation resources as the basic 5 instruction siring. This capability also may be 
employed so provide support for unaligned load and store instructions, where a single issue . 
suing may contain as an alternative a single unaligned load or store instruction which uses 
the resources of the two load-class units in concert to accomplish the unaligned memory 
opera non. 

Result Forwarding 

When temp.*alry adjacent instruct*™ arc executed by separate resources, the results of the 
first instruction must KcneraDy be forwarded dirccdy to the reaource used to execute the 
second instruct**!, where the result replaces a value which may have been fetched from a 
re#ster file. Such forwarding paths usr significant resources. A Zeus implcmcntarion must 
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generally provide forwarding resources so that dependencies from earlier instructions within 
s string arc immediately forwarded to later instructions, except between s first and second 
execution instruction as described above. In addition, when forwarding results 60m the 
execution units back to the data fetch unit, additional ddsy may be incurred. 
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Instruction Set 

This section describes the instruction set in complete architectural detail Operation codes 
are numerically defined by their position in the following operation code tables, and are 
referred to symbolically in the detailed instruction definitions. Entries that span m<..e than 
one locaoon in the tabic define the operation code identifier as the smallest value of all the 
locations spanned. The value of the symbol can be calculated from the sum of the legend 
value* to the left and above the idennfier. " 

Iiutmcnop.s that haxc grrjt MmiJanry and idcmiral formats arc grouped together. Starting on 
i new page, each category ol instructions is named and introduced. 

The Operation ctnks section bsts each instruction by mnemonic that is defined on that page. 
A textual interpretation of each instruction is shown beside each mnemonic. 

The Equivalences section hsts additional instructions known to assemblers that are 
equivalent or special cases of base instructions, again with a textual interpretation of each 
instruction beside each mnemonic. Below the list, each equivalent instruction is defined, 
either in terms of a base instruction or another equivalent instruction. The symbol between 
the instruction and the definition has a particular meaning. If it is an arrow (+- or -♦), it 
connects two mathemabca))' equivalent operations, and the arrow direction indicates which 
form is preferred and produced in a reverse assembly. If the symbol is a («=), the form on 
the left is assembled into the form on the right solely for encoding purposes, and the form 
on the right is otherwise illegal in the assembler. The parameters in these definitions are 
format the names are aoWy for pattern- matching purposes, even though they may be 
suggestive of a particular meaning. 

The Redundancies section hsts instructions and operand values that may also be performed 
by other instrucrions in the instruction set. The symbol connecting the two forms is a (») 
which indicates that the two forms are mathematical^ equivalent, both arc legal, but the 
assembler docs not transform <*nc into the other. 

Ihe Selection section hsts instructions and equivalences together in a tabular form that 
highlights the structure of the instruction mnemonics. 

The Format section hsts (1) the assembler format, (2) the C intrinsics format, (3) the bit-level 
instruction format, and (4) a definition of bit-level instruction format fields that a.- not a 
one-fiir-one match with named fields in the assembler format. 

The Definition section gives a precise definition of each basic instruction. 

The Exceptions secocm hsts exceptions that may I* caused by the execution of the 
instructions in this category. 
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Major Operation Codes 

AU instructions arc 32 bio in size, and i 

code. 



31 



24 23 



I major 1 



the high order 8 bits tu specify a major operation 
0 



other 



24 



] 



The mtjof field is filled with a value specified by the fouuwtng table: 5 




major operation code tick! values 
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Minor Operation Codes 

IVw the major operation field values A.MINOR, B.MlNOR, (.MINOR, S.M1NOR. G.8. 
G.I6. G.U G.64. G.I28, XSIIIFT!. XSIUFT. E.8. E.1A, R.32. R.64. F.128. U. MINOR 1. 
and W.MINOR.B, the lowest-order six bits in the instruction specify s muvK operation 
ctKte 

3] 21*3 ; 45 0 

I ™»Jor I other | minor | 

8 18 6 



11k minor field is filled with a value from one of the following tables: 
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minor operation code field values for AMINOR 
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minor operation code field values for B.M/NOR 
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minor operation code fkHd values for LMIIMOR 
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minor operation code field values for S.MJNOR 
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minor operation code field values for Gsfre 


XV*Tl 


0 


• 




24 




40 


4a 


ia 


















FCOUNKUi 








gag 




awn 








minor operation code field values for XSHJF 
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minor operation code field values for XSHIFT 



1 1 wr 


o | a 


la 


24 


J2 


40 


4a 


sa 


\ o 




iA0C*N 










KON 


















KONU 






i/bbH 


rtuaw 


IMAM 


















IMULC 
















f*ua*x 


(UaUaJ 




fOfefX 


ttw 














IUaUjAJ 


I0r\4 


I0MJ j 












IMUtuaM 


ImulsumJ 








M6il 






fjAAioae 


luuLSua/ 


ru»*«*Y 



minor opc.abon code field values for E.sJze 



60- 



MkroUnity 



Zeus System Architecture Tuc. Aug 17, |W9 Instruction Set 

NWmm Ofmitoo Codrt 



0 

cmrr 



TOHKTTJ 



mm 

MUJUUUU 



eg I i' 



wcna w apl 



wonwrn 



wnsomr 



uaunmr 



3C 



3! 



2E 



minor operation code field values for U/.MINOR.L or wXunOR.B 



For the major operation ficM vJuet F..MULX.I, F..MDLX.1.U, R.MULX.IM 
R.MUIJC1.C, R.MUI-ADD.X.1, E.MULADD.X.I.U. F.MUI.AD1XX.I M* 
F-MULADD.X.I.C, E.CON.X.1U KCON.X.1.B, E.CON.X.I.U.1, F..CON.X.I.U B* 
E.CONXI.M4, R.CON.X.I.M.B, F.CON.X.I.C.I, FXON.X.I.C.B, P..P.XTRACT 1* 
E.FJCTRACT.I.U, WAIUI.MAT.X.I.UU W.MULMAT.X.I.U.B. W.MHI.MAT.X.I.M |* 
W.MUI.MATX1-M.B. WJWLMAT X 1.C.1, and W.MULMAT.X.I.CB, another aix bit. 
m the instruction specify » minor operation CoJc, which indicates operand size, rounding 
and shift amount 
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The minor field » filled with • value fron. ihc following table: Note that the shift amount 
field value shown below it the M ah M value, which U encoded in an instruction -dependent 
manner from the immediate field in the as aembler format. 
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minor operation code Add values for EMULXI. EMULXJU, EMUUK1M 
EMULMC EMULMDOXI, EMULADDXJU, EMULADDXIM. EMUL/SDOXJC 
ECONXIL ECONWa ECONXIUL. ECONXIUB, ECOMXIMU ECONXIMB 
ECONXICL. ECONXICB, EEXTRACTI, EEXTRACTIU. WMULMATXIUL ' 
WMULMATXIUB, WMULMATXIMU WMULMATXIMft WMULMATXJCL and 

WMULMATXJCB. 



For the major <f. nation field values GCOPYI, two bin in the instruction specify an «»pcrand 
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Fur the major opcmion field values G.ANDJ. G.NAND.l, G.NOR.I, G.ORJ. C XOA I 
G.ADD.I, G.ADD.LO. G.ADD.I.UO, GJRT.AND.FJ. GSRT..AND.NFJ. GJRT.FJ 
GJRT.GRJ, GJRT.LI. GJET.NE.I, GJRT.GFJ.U, GJRT.IJ.ll, GJUB.J, GJUB.1 A 
G.SUB.l.t K), twii bits in the instruction specify an operand sue: 

31 24 23 18 17 
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Imm 
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'Ihc ix fickl u tilled with a value from the following table: 
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G.X0RJ. GADOX GADO.I.O. GADO.I.UO, G3CT.AND.U GSctJKoTnSl 
G.SET.EJ. G.SET.GE.I. G3ETJJ. GJETNEl G3CT.GEJJJ. G-SET4J.U. &SU&1 

G3U8J.O. OSUBJ.UO 

P<»r the man* operation Add valuta E.8, R.I*. RJ2, E.64, E.128, with minor operation Add 
v«lue P..UNARY, another as bits in the instruction spedfy • unity operation code: 
g £1 « jB 17 |2_m 6S 0 
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rd 



I rc | unary | minor "I 



The unary Add ia filled with a value from the following table 




unary operation code field values for E.UNARYjIzc 



Pi* fhe m»}<* ..pennon Add value* A.M1NOR and GJvllNOR, with minor operation field 
value. A.COM and G.COM, another six bin in the instruction specify a comparison 
operation Cialr 

21" 18J7 I2H 65 0 

I major | rd 



I rc | compara \ minor | 
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The omvptff field it fiDcd with a value from the ft Jkiwing table 
•5£ 
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compare operation code ...Id values for ACOMop and G.COMop.Jizc 

General Forrm 

the Kcnenl fttftna of the instructions coded by • mi}<* up™"*"" ««k are one of the 
fuflovtnjc 
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The general fimm of the imuuebona coded by major and minor operation codes are ihk of 
the (tJkrwWft 
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1 m **<>' 
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?4 23 18 17 l? if 



j 2 1 1 6 5 0 

I rb j minor 1 
6 6 



6 5 



rd 



I rc j >lmm | minor 1 



The general form of the instrucoc>na coded by major, minor, and unary <iperation codci it 
the following: 

*! 1817 J7JJ 65 0 

I m«]or 1 rd 1 rc j unary T minor 1 

i s 1 r^-* — s — 1 
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Reciter rd is cither a source register <« destination renter, or both. Register* rc tnd rb i 
$iwty% source register 1 Register ra is always s destination register 

Instruction Fetch 

DcflQtttQQ 

dW Threadftf* *% 
tor*v*» do 

catch excepoon 

rf |f ventflegdter & f v*ntMas*|thfl # 0 then 
ExcepOoMtAte-0 then 
raise Cvendnumx* 

ends! 



mst «- Lo»dMernory>3r¥o9 Wti Cou^^ 

Instructio/yinst) 
endcatch 
case exception of 

EvendnterrypC 

ReservedtrkssjucsjoA, 

AceesjQtiJiOMwgyWtujU^dreu 

AcceiiDiiaiowrdOyTag. 

^e»r»i«io^cfByOlohafTE 

*ceiuOuattowro6ytocafTa 

^ceetsDetasWt qusredByTag. 

AcceiiDetaiiffcqusredDyGtobalTa 

/<ceuOris^equreoaya>carra 

MwlnGJoba/Ta 

MtsslnLocafTflL 

Ftee<JPo*nt^fCvnetic 

Ftaaongftrtnt* tftmedc 

GatewayOtoaaowed: 

cast ExcepoonState of 
ft _ 

Perfofrri£xceptionfexcepfJon| 

I: 

Fefformtoeptton(SfCondCxceptK)0| 
PertomMacne^r^i^rhedCxceprior^ 

endcase 
Takendranch: 

CononuaoonState 4- (£xcepoonState-0) ? 0 : CononoaOonState 
TakendranchContinur 
/• nothing •/ 



Proof aroCountef Pvoo/amCounte? ♦ 4 

CononuaftonState «- (£xcepnonState«0| ? 0 : CononuabonState 

endcase 
endforevcr 
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liMtruchon Decode 

Perform Exception 

QcfiOttQO 

def PtHonntMkvyuiM^Mt^oni as 

v «- fexcepoon > 7) ? / : exception 

t «- loadMemoryftxcepttc ^8axc£Jce^bonlUje*Thre#<r 1 20*64*8*v.64,U 
ExcepbonState • 0 then 

u «- ««9»e*d(3.l28| l» Aeo*ead|Z128| II ffeg*ead(t.l28| 1 1 RegfteadfO. 1 28) 

«orcA«e*TK)fy(Cjtcep©ona ik» EKepoonaase* Thread* 1 2a$ 1 2.LM 

**e^Whte(0.64.ProgtamCi untef 6 ^ 2 1 1 Pnvilegeleve* 

RtgWntrf 1 .64.Excepoon8 Thread* 1 28) 

RtcjWntef2.64.except»or4 

Rec/Jt^3.64/a*ngAddi ess) 

endif 

PrMlegetevrt 4- t|.o 
ProgramCourtter t^j.j 1 1 0* 
case exception of 

^cessOetaiMeciulreclSyTag. 

/ccessOetaWequiredByGloba^ 

AccecOe t aiWeq u iredBytocafTa: 

ConttnuattonState ConbnuabonState ♦ 1 
others. 

r nothing •/ 

ervtcase 

txce.ooonSwe 4- ExceptionStatc ♦ I 
enooer 



Instruction Decode 

def tftttrucbor^inst) as 
major 4nst 3 , -24 

rd inff2x.ia 
rc 4~ *st|7 M t2 
simm rt> insti i 
mmor ra «- tnst$.. 0 
case rrvgor of 
ARBS: 

ArwaysReserved 
AMNOfe 

minor «- insf 5 o 

case minor of 

AADQ. AADO.a AADO.OU. AAND. AANON, ANAND. A. NOR, 
A OR. AORNL AXNOR. AXOfc 

Address(minor.rdLrcrbi 
ACOM: 

compare instn 6 
case compare of 

ACOM.E. ACOM.NE. ACOM/NOi. A.COM AND.NE. 
ACOM.L ACOM.GE. ACOM.LU. ACOM.GEU: 

AddressCc*npare(ec*npare.rd.rc) 
others: 

raise Reservedlnstrucnon 
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Instruction Set 
tWracftoa Dtcodt 



endcase 
a sua asuba Asuauo. 

AiElANDJL AiCTANONt ASTT-t ASfT.NE. 
ASfTi, AS£T GE. ASEU.U. ASET.GC.U 

Addi cssScvcf ied|n A lor sdrcrtol 
A£HLXAD0LASHLLADO»3: 

AcWr e«Sh^e«mme<MteAckJ<in« | o/d.rt.rtof 
ASHtl.SUa ASHU.SUB* 3 

Ad()rruSh^efttfTvnediateSubcrari)iri« j 0ta rcJ.rc.rtat 
ASHU. /ISHU.Q. ASHUU.O. A5HR.I ASHRJ.U AAOTRl 

^JdreiiShi Wf nmedi>>e(riw w 
other*. 

raise Aeservedlfuouctton 

endcase 
ACOFYJ 

AddressCopy«mmediate(rTW)or/(linni7 q| 
AADOX AADO ta AAOOIU Q, AANOJ. AO*l ANANOJ. ANOU AXOUt 

ASTTANO^.l ASETANONEl ASCT1 J. ASTTNCJ. 
ASTTX.L E^TT.GCi ASmtU ASET.GC.IU 

Asuai Asuai.o. Asuaxua 

Addre«immc<Ji«te«evene<^ d 
AMUX 

A^dfetsTemarytrrwior/clrc/tu^ 
case minor of 

a 

8ranch|rdLfC,rhf 
BRACK: 

Brant hBackJrOfcrtj 
aa^ac* 

aranchfiarrieffrd^crbl 
a DOWN: 

BrancrOowntrd.rc.rt* 
B.GATE; 

BranchGatewayfrd.rt.rt* 
aHALT: 

BranchHaKfrctrcrq 
aWNT: 

BrarchHinttrcUnst i t j.»mm| 
aUNK: 

BrafKhUr*|rtlrc.rtt 
others: 

raise fceservedlrotrucfcon 

endcase 

B£. BNE. BL BGt BLU 8GE.U. BAND.E. BAN&NE: 

Branc^orvMtonalfma^ d 
BHlNTfc 

BranthHirtfrrwried^^ i2>«m ol 

Bt: 

Branchlrr)rrtf?diatefinsi2J ol 
BUNKJ: 

BrarKhlmmediateUnk|inst23 ol 
BEFI6. BUSFI6, BLFI6, BGEFI6. 
B6F32. 8LGF32. 6LF32. BGEF32. 
BEF64. &.GF64. BIF64. BGEF64. 
66FI28. BLGF12& BLFI28. BGEF128: 
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ar^ncr^oncMc^toMngl'ic^mMor.rd.rt.initii a| 

mn. imn. env* 32. bvf si 

c«e miner of 

U4L LUI6L U1L IU3JL. 164L IU64L UML I*. LI* 

U6E LUIAa L3JMU32a 164E lU64a 

Lo»d(minof.rd.fc.fh| 
ochm: 

ratse RnervedlnMruciion 

ervJcMC 

UI61 IUI6C U3JL IIU32L U64L IIU641. UIJ8L 118 W/a 
UI6AL IIUI&AL U32AX. UC/32AL U64AL UU*4*L UI78AU 

tma ijui6a U3*a uwa i*4a lawir, uiiea 
uiam uuiMa u3Ma uu32*a uwa taut* una/va 

U«dtmrne<IUMr|rMjor.rd.rc.initi| <J 

c«w rnmor of 

SI6C S32C S64L. SI20L Sa 
SI4A1. S32AL S44AL SIMM. 
SAS64AI. SCS44AJ, SMS64AL SM64AL, 

$i6a sua S64a sina 

SI&Aa S32A& S64M SIJ8AB, 
SAS64/A SCS64M. SMS64M SM64Aft 

StofffminoTardLfCfbl 
S0CSA4M SOCS64AL 

$»r«OoublKomp*rt$vwap(minw.rd.r«.re( 



wfcf JtatrvetfJostructton 

tndcase 
SI16U SIJ2L SJ64L SII28C SI* 
SIIAAC $16**1. yij&M 

$A«64AL SCSI64AL >MSJ64AL SMUH64AL 

snta »wa «64a sita* 

SII6M SI32AE SJ64Aa SJI2AAB 
SASJ64*a SCSJ64AB, SMSJ6M& SMUXJ64/sa 

StorrlrTvr»dUte(rT«jof /d/c.tnii , , oj 
Of. 0.16, 0.32. G.64. O.I 28: 

minor m$t 5 0 

case minor of 

O^XX 0-AOO U 0/00 LU 0/DOO. 0/D0CX> 

Group(m<nof4ttf 4 fd.rc.rb| 
GADOHC QJVOHf. QADDHN, QADDHZ 
OADOHUC. QADOHUf, QAODHUN. GADOHUl 

GroupAddhOVe(m*nor,»ntt , u ujf.r4rc.rfai 
GAW G-ASA 

Groi^nptacf(minor.iijf,r4rc.r^ 
CSCTAND* O.S£7ANDNC. 0 $f Tf. 0.$€7.NI. 
G5nt O.SH.OC. O.SCT.LU. G.SfT G£ U 

oitra o^uo t o.suam o.sub o. osubuo 

Ckoyp«fVfrifd|m<nor.Wf.rji.fb,rc| 
G5€T.£/. GSCUOF, O.KT.GE.F. OHUF 

cscri/x ascuofx ascT.oc.rx GUUIX 

GroupacvewctftaAfingPa^ 
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Imtmcfto* t>*corW 



minor. round, rd. re. rd| 
0 SHL IADO .G.SML 1*003. 

Gfcwp$rurUert>mrne<*att^^ 0.sue.rdLrc.rty 
GSHUSU&GSHUSUBO. 

Grt>upShMeftlmrne<*ate$u^ I . o.stf«.rd.«.rty 
GSU0HC. G SU8HF. G SU8MN. G SlJBKZ. 
G SL3HUC. G SU8HUF. OSUBHUN. G.!08HU/ 

GroupSuWrar**-WvrfmKW.KKt| <>ji/e.rd.rc.rt)f 
GCOM. 

compare «- rot 11. .4 
case compare of 

G COMi. GCOMJ4C. GCOMJWD £ G COM ANONL 
G.COMi. G COM.Gt aCOMLU. O COM Gf U 

GroupComparefcompare.iue.ra.rbf 
ocners. 

r^toe AeservecHnstruction 

endcjse 
otneri 

r **$e Reserve^dlrutrucfeon 

endcase 
G BOOltAN G eXXXLAN* 1 : 

G COPVJ GCOV>1: 

stre ♦-Oil I 11 o*»«i/ 14 

GfOcpCopyH m mex^acefrr^of Jtfe.rdrtfl 1 5 ol 
G-ANO.I. GNANO.I. G NOW, G.Oft.1 CLXCXM. 
G,\DCH GAOOIO. G^OOJUO. 

li/e «- 0 I I I II 0 4 ** 1 *!! 10 

Grouplmmedutet na/or jifesdrtinst* q| 
G S£T«ANOU G.S€T ANO.NEX GSEltl G^7.G€ t O-SCTU 

G.scTNC.i G.srr.G€J.u. gsct.uu. gsubi Gsuaxa asuaiua 

we ♦-Oil 1 11 o*****it 10 

Groupfrrtfr«dia4efteverse<l(rn*^ oj 
GMUX 

GroupTerriaryfmajor.rd.rc.rtua) 
X SHIFT: 

minor «- «$t 5 2 1 1 0* 

u/e ♦-Oil I ll 0^*14 ■ l ««i..oJ 
case minor 0/ 

X EXPAND. XUCXPANO. XSFL, XSMl.O. XSHLUO. 
XROTR. XSHR. XSHft.U. 

Croxsbar|mmor. we.rd *-\rfcf 
XSHLM XSHRM: 

Crou^lnpi^efrninor.we.rd.rc.rt)| 
ocher* 

raise Reservedlnstrucbon 

endcase 
XtXTKACT: 

Crossbar£xtract|major.rd.rc.rb.ra) 

x oePosiT. x oeposn.u withdraw xwithdran* u 

Croisbarfie4d(ma/or.rd.rc.instu ^msts ol 
XOfPOSfTM: 

Crossbarfie4dlnpiace(rfvyor.rd.rc.inst|| ^mstj ol 
X SHIFT I; 

minor «- mrtj q 
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A u> S%mi„i An Wiw Uh. Au* P. »W lt>Mruo l( .„ Svi 

Intiruihim |)r«,«lt 

case mmor s i M O' o# 

XCOMHKSU XEXPANOL XJOTM XSMLl XSML.IO. XSHLIUO 

xshri » ccmcsiiu xexpandiu. xswrw 

Cro\UwrV>orr»fTvnr<>wCrinwnof jd.rc.s>mrrt 
X SHI Ml XSHRMf 

OossoarSfK>rtlrtvnediafc<#i^ 



raise Reservedlnstruction 

endcase 
XSHJFFLt XUAMM** 

C rossbarSnufflrtm*jOf . rd. rc . rb. wmm) 

( rossDarSwuilef/Tuijor.rd.rt. mst j , ^msu nl 

kseu na 

Cf MSbarTrmaryjmajor.rd.rt .rtxraJ 
Ca E Id. C M. £ 64. U/8: 
minor mst* 0 
we#- 0 I i I M f>**ma o» f • 
case mmor of 

C CON. ECONU E.lONM. £ CON.C 

E MUL. E MU.U E.MULM f.MUl.C. ' 

EMULSUM. IMJLMJNLU. E.MULSUMM. E.MUL.SUMC. 

COV. £ OrVU £ MULfc 

£ruembfetmmor.stfe.ra.rtxrt| 

CCON.fi. £ COMf a ECONXFL. E.CONCF a 
£^semt)teConvorVfflojfjnQ/Hjii^ 

tAOOIH £ MULCF NL E MUL F N. fOIVF.NL 

IAOOJ7 £.MUL.CFZ f MUl«7. £0tV£7. 

£<ADOFF. EMULC.tr. EMULFF. EDrVFF 

EADOFC. EMULCtC. E MUL EC. £ OrVFC. 

E/0OF f MULCF. E MUL F. £.OtVf, 

£>DOFX EMULCFX fMUfX E.OrVFX 

, ^ J™?™^****^^^ major sue. mmor round, rd. re. rbt 

£ MIA ADO, CMUlADaU £.MUL>O0M £MUL>VDO C: ^ 

^-ruemo*emplace<mmor.tt/e.fd.rc,rb| 
£ MUL.SUa £ MULSUaU E.MULSU8M £ MUL SUB C 

£rmmoremplaceffeve^sed|mmor t stfe.rdrc/b| 
f MA SUB* EMLA_SUaCF: 

Eny.mbiempiaceRevenedFtoatingP 
e.sube x t suatz ESUaF.F. E.SUBFC. E.SURF. £ suaFx 

£memiie<Jevenedf*oatir^f^rrf|mmor.op, ma/or.we. 
mmor.roundL rd. rc. rhj 

E.UNARY 

case unary of 

£ SUM. EJUMU. £ LOG MOST. £. LOGMOST.U: 

Enserno<eUnary|ufvvy.rd.ft) 
EABSF. EABSFX f COPY.F. COPTS X 
£ OEFIATE.F. £ fXFLME F.N. E.DEFLATE.F7. 
E. DEFLATE .F.F. E.DEFLATE.F.C. E. DEFLATE. FX 
E FlOAT.F. £. FLOAT. F.N. E .FLOAT.F 7. 
EFLOAT F F. E FLOAT.F C. E .FLOAT.F X 
F INFLATE F. E. INF LATE. FX LNiQ.f. ENZGfX 
ERECESTF. E RECEST f X £ RSOREST F. ERSORESTFX 
E SORF. E SOft F N. ESORFZ. ESORFF. E SORF C. E SORFX 
£ SUM F. E SUM. F N. E SUM F Z. 
£ SUM F.F, E SUM F C. f SUM.FX 

E SlNXF. £ SiNXF/O. E SINX F F O. E.SiNX*. CD. t UNKfXD 
I SINX F.N. £ SJNX.F ./. E SINX F F. E SINK.F.C. IVNK FX 
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Crt*moieUr^ioati^ meruit. 
unary round, rd rcl 



f«* fteservedtnstructijn 

endunc 

OChTfS 

'a#se fteservedinstrucbon 

eodtase 

£CONXiU ECONXia ECONXHJL ECONXlUa 
€ CONXIVL E CONXIMa f CON XKX f CONXrCB 
are 4- I 1 1 0>«H 4 

Ensern«eCoiivr**£x&jr^^ 2.werdLrcmn«i rJ 

CMULX E.EXTRACT. EXMA^ 

EmcfnbMxtrac^mMor^u. uraraj 
£ tXrtUCH C IXTR^CTR/f MUUa I.MUU0U EMAX1M. EJUUUOC: 

Ui e 4- I M 0><n*i 4 

£ nsemt** *trac*rime<*ate^or.mjt j 7 .tut r&r<Mi.tns: \ q) 
£ AAX^OQXl £ AJUt>DOXtU IAIUL>DOXIM f JUULAOOXIC: 
Hit I I I 0>**H4 

Ensemtrfet *trac Un* Aed*e*npi*efm^^ , ^ 

we 4- I I I 0 

ctt^S^' 6 EMUL ^»*** CMUL*DOF64. £J*>l>OaFI28 
EMUUUaFI6. EMAJUftf U IMXSUaFM. CMOUUEf I2t. 
ESCALW»FI6. £ $C^C^m£SC^>DO>6r^ 
£nsefT^Trmjryflo*ongPo^^ 

W MINOK.a W MOSKJAX; 
caie rnmor of 

W TRANSLATE* W.TRWSLATf .16, W.TRANSIATC J2. W.TRANSLATE.64 

sue I 1 1 o 1 * 4 ^ ^ 

WWHTrarw4jetffT«jOfji^rxtrcrtt 
WMULA4ATJL WMULMAT.I6, WJUULM\T.J2 WJUULMAT.A4 
W.MULMAT.UJL WJ^U^W 

W.MULMATMa WMiLMATMlt WJY^T* 32 W^^ii 

WMULMATAa W.MU^T.P.I4 W.MULMAT PJ2. W.MULMAT.P.64 
* ? 4- I || (p^njtv* 

WMUl.UAT/lfc W.MULAIATF32. W.MULMAT.F64. 
W.MULMAT.CJ 16, W-MULMAT.CF 32. WAWUMAT.C.F64: 
we 4- I 1 1 O 3 ***** 4 
UWIoatinglW^ 
ocnen: 
endcase 

WMUL MATXa WAUCMATXl: 
Wk*e€*traafrTwyor,r*rfcrc.rdf 

^^^v!f. ^ MWWJU *• W MULMATXI U.a W MULM*TXI.UL 
W.MULMArXIM.a WMA.MATXI.M L W.MULMATXI.Ca W.MUl.MATXI.Cl 
life ♦-III 0 3 * 4n «% 4 

WideCxtrjctlmrnediaeef^^ fa.fb.rc.mst | q| 

NV MUl.MAT.G-a WAH1MAT.G.L 

Wid^MurtipfyOalors^naior.rd rt,rfcral 
W SWlTCHa W.SWTTCM.I: 

W»deSwitrh(rnaior.rd.rc.rtxra| 
others. 
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lnttrucrxm Drcmir 

rase Reservcdlnstrucbon 

endcise 
enddef 
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Afwavs Reserved 

This operation generates a reserved instruction exception 

Qpcratan f ortff . 

i Always reserved I 

Format 
ARES imm 
aresfimrn| 

3J 24 23 Q 

I ARES | Imm | 
i 24 

The reserved instruction exception is raised Software may depend upon this major 
operation code raising the reserved instruction exception in aO implementation*. The choice 
of opmbon aide intentional})' ensures that a branch to a zeroed memory area will raise an 
exception. 

def Aftffjyifteservcd as 

r*se taervedJnstrucrjon 
enddef 

Krwnrvd Instruction 
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Addtttt 

Address 

11kic IT" 1 " 1101 Pct*hwi calculations uith rwo general register values, placing rhc result in a 
KcncraJ remitter. 



Operation rate 



AADO " 


Address add 


AADO.O 


Address add signed check overflow 


AADOU.O 


Address add unsigned check overflow 


AAND 


Address and 


\ANDN 


Address and not 


ANAND 


Address not and 


ANOR 


Address not or 


AOR 


Address or 


AGRN 


Address or not 


AXNOR 


Address exclusive nor 


AXOR 


Address xor 


Redundanripf 




<=> A COPY rd=rc 


AANO r<Wcrc 


<s> ACOPV rcUrc 


ANAND rd-rcsc 


<=> A/VOr rd»rc | 


ANORrcWerc 


<=> ANOT rch*rc ~\ 


AXNOR rSTcrc 


» ASFTrd 


AXOR rcWcrc 


<=> AZERO rd 


AADD rd»rc/c 


» ASHLJ rd»rc.l 


AADO.O rdUrcrc " 


» ASHU.O rd-rc.1 


AADD.U.O rd«fcrc 


« ASHLI.U.O regret 



Selection 



dass 


operation 


check 




ADO 


NONE O U.O 


bitwise 


OR AND XOR ANDN 
NOR NAND XNOR ORN 
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Format 

op rd»rc.rt> 
rd«op(rcrt| 

gii ™» ia I? t^M 6S o 

1 AMINO* I rd | rc 1 rb 1 op \ 

B 6 6 6 6 ' 

PCXriPtiQO 

The content* of renters rc and rb arc fetched and the specified oncntion is pcr**rned on 
these operands The result is placed into register rd 

GsfiomoQ 

def AddrtssJofxroVcrq as 
c Regfteadfrc. 64| 
b «- ftegteadfrtk. 64J 
case op or 

a c ♦ b 

1 fcdj 1 1 cj ♦ (on 1 1 o) 
* <64 * t*j then 

'*$e FuedPoM^rtmetic 



* - <63 0 
AAOOUO 

t «- fO 1 II cj ♦ (0' II b| 
^ (64 * 0 then 





FuedPc*r*vV*rvrwtK 


end* 


a 


<63 0 


AANO 




a 


c and b 


AO* 




a 


c or 6 


AXOfc 




a «- 


c Jtor b 


AANDN 




a «- 


c and not b 


ANAN&. 




a 


not (c and bj 


ANOR: 


a 


not (c or o) 


AXNO* 


a «- 


not (c xor b| 






a *~ 


c or not b 



endcase 
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Instruct*** Set 



JtegUttrfrd. M. 4 
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Address Compare 

TScic operations perform calcubbunt with rwu grrcrai repiter vtluct tixi ^dciix • fiscd- 
potm arithmetic cxccjmmxi if the condition specified n met 



Operation codct 



ACOMANDE 


Address compare and equal 2ero 




ACOMANO.NE 


Address compare and not equal zero 




ACOME 


Address compare equal 


ACOM.GE 


Address compare qr eater equal signed 


ACOM.GE.U 


Address compare qr eater equal unsigned 


ACOM.I 


Address compare less signed 


ACOMLU 


Address compare less unsigned 


ACOMNE 


Address compare not equal 


Eautaisnda 


ACOME2 


Address compare eoual zero 


acouoz 


Address compare or eater zero siqned 


ACOUGE2 


Address compare qr eater equal zero siqned 


ACOML2 


Address compare less zero siqned 


ACOM.LE2 


Address compare less equal zero siqned 


ACOM.NE2 


Address compare not equal zero 


ACOM.G 


Address compare greater siqned 


ACOM.C.U 


Address compare qreater unsigned 


ACOM.LE 


Address compare less equal skjned 


ACOM.LEU 


Address compare less equal unskjned 


AFIX 


Address fixed point arithmetic exception 


A NOP 


Address no operation 




ACOM.E2 rc 


*- ACOMANO.E rcrc 




ACOM.G2 rc 


<= ACOMLU rcrc 




ACOMGE2 rc 


<= ACOM.GE rcrc 




ACOM.L2 rc 


<= ACOML rcrc 




ACOMLE2 rc 


«= ACOM.GE.U rcrc 




ACOMNE2 rc 


ACOMAND.NE rc.rc 




ACOM.G rcrd 


-> ACOM.L rd.rc 




ACOMGU rem 


ACOMLU rd.rc 




ACOMLE rcrd 


-» ACOM.GE rd.rc 




ACOM.LEU rcrd 


-» ACOM.GE.U rd.rc 




AFIX 


ACOM.E 0.0 




ANOP 


«- ACOMNE 0.0 
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Addrr«t Compare 



Redundancies 



ACOM.E rd.rd 


« A fit* 




ACOM.NE rd.rd 


<=> A/VOP 




Selection 


class 


operation 1 


cond 


operand 


boolean 


COMAND COM 


E NE 




arithmetic 


COM 


I GE G if 


NONf U 




CO/W 


L GE G LE E NE 





Formar, 

ACOMop rd/c 

acomopfrcLrc) 
acomopzfrcd) 

31 24 23 18 17 12 11 6 3 0 

I A.MINOR I rd I rc I op I ACOM I 
5 6 f 6 

Description 

The content! of register* rd ami rc are fetched and the specified condition is calculated on 
these operands. If the specified condition is true, a fixed-point arithmetic exception is 
generated This instruction generates no general register results. 

Definition 

def AddressCornparefop.rd.rcl as 
d <- ftegfteadfrd. 128} 
c «- Regfteadfrc, 128) 
case op of 
ACOME: 

ACOM.NE: 

a «- d n c 
ACOMAMO.E: 

a «- (d and c) • 0 
ACOM-AND.NE: 

a |d and c) # 0 
ACOM.L 

a |rd » rc) 7 K < Of : |d < c| 
ACOM.GE: 

a «- |rd • rc| ? |c * 0) : |d 2 c| 
ACOMLU: 

a — |rd - rc| 7 (c > 0) : (|0 1 1 d) < |0 1 1 c|| 
ACOM.GC.U: 

a (rd « rc) 7 K * Of : R0 I I d| * |0 II c|| 

endcase 
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Aitdmt < :of>) Immediate 

Address Copy Immediate 



ITus operation produces one immediate value, placing the result in a genera) register. 
Operation cod« 



lA.COPV.1 


Address copy immediate 






A5tT 


Address set j 




Address zero | 




\ASETrd 


♦- ACOPY.I rd»-l 




\AZERO rd 


f- ACOPY.I rd»0 






Format 

ACOPY.I r<t*imm 



rd^copyifimmf 

31 24 23 18 17 0 

I A.COFVJ I rd I Imm 1 

8 6 18 

Description 

An immediate value is sign-extended f.om the 18-bit imm field. The result is placed into 
register rd 

Pcfinitfon 

del Atk)msCopy1nrvrwdiace(op/d>Tvn| as 

a «- (ffnmjj 0 II imm) 

RegWhcefrd. 128. a) 
enddef 

EaceflBaoi 

none 
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Address Immediate 

'nicse i>pcranons perform calculations with one general register value and one immediate 
value, placing the result in a general register. 



Operation cody 



AADD.I 


Mdress add immediate 


AADD.I.O 


Address add immediate signed check overflow 


AADD.I.U.O 


Address add immediate unsigned check overflow 


AAND.I 


Address and immediate 


ANAND.I 


Address not and immediate 


ANOR.I 


Address not or immediate 1 


AOR.I 


Address or immediate 


AXOR.I 


Address xor immediate 




AANDN.I 


Address and not immediate 


ACOPY 


Address copy ^ 


ANOT 


Address not 


AORN.I 


Address or not immediate 


AXNOR.I 


Address xnor immediate 1 




AANDNJ rd=rc.inm 


-* AAND.I rd*rc-imm J 


ACOPY rd=rc 


4- AOR.I rd^rc.0 


ANOT rd=rc 


*- ANOR.I rd=*rc.O 


AORNJ rd=rc.knm 


-+ AOR.I rd=rc-imm 


AXNORJ rd=rc.imm 


AXOR.I rd»rc-imm 


Redundance 


AADD.I rd=rcO 


<=> ACOPY rd^rc 


AADD.I.O rd=rc.O 


c=> ACOPY rd=rc 


AADD.I.U.O rd=rcO 


c* ACOPY rd^rc 


AAND.I rd=rc.O 


<=> AZERO rd \ 


AAND.I rd=rc-l 


» ACOPY rd^rc 


ANAND.I rd*rcO 


e=> ASETrd ~f 


ANAND.I rd=rc.-l 


« ANOT rd=rc 


AOR.I rd^rc-1 


<=> A5£T rd 


ANOR.I rd»rc-l 


<* AZftfO rd 


AXOR.I rd=rc.O 


ACOPKrdbirc 


AXOR.I rd=rc l 


<=> ANOT rd=rc 
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Add rets Immediate 



Sctaatao 



class 


operation 


check 


arithmetic 


ADD 


NONE O UO 


bitwise 


AND OR NAND NOR 
XOR 





Carnal 

op rdarcimm 
rd*op(rclmm| 

31 24 23 18 17 12 It Q 

I QP I rd 1 rc I Imm I 

8 6 6 12 

Bcaajfltton 

The contents of register rc is fetched, and a 64-bit immediate value is sign-extended from the 
12-bit imm field. The specified operation is performed on these operands. The result is 
placed into agister id 

Definition 

def AddrettJrnmedwKefop,rd^,4rTvr^ as 



I «- knmf f 1 1 k 


nm 


c Regfteadfrc 


64) 


case op of 




AANDJ: 




a «- C 


and i 


AO*J: 




a «- c 


or 1 


AMANO.h 




a c 


nand i 


ANOfU: 




a «- c 


nor 1 


AXOflJ: 




a «- c 


MOT C 


AAOOJ: 




a «- c 


♦ 1 


AA0OJ.O: 





t «- to 1 1 4 » to 1 1 I 

if t^4 * then 

raise FuedPointVitrvrietK 

end* 

* «- iw.-o 

AADOJU.O: 

* <- to h 4 » to • m 

to4 # 0 then 

raise FuedPo*ntAr«hmetK 

end* 

* *- <63 0 
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endcase 

RegWntefrd. 64. a) 

Eaccfiflom 

Kucd potnl tndimrtc 
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AJdivu Immrdutr RrvrrarU 



Address Immediate Reveled 

Hicsc upetmtmiu perform calculations with one general register value and one immediate 
value, placing the remit in a grncrJ register. 



Operation cprfo j 



AJETAND.EI 


Address set and equal immediate 1 


ASE+.AND.NE.I 


Address set and not equal immediate 


ASET.EJ 


Address set equal immediate 


AitY.GEJ 


Address set greater equal immediate signed 


XSET.LI ~ "'■ 


(Address set less immediate signed 


ASET.NE.I 


Address set not equal immediate 


ASET.GE.I.U 


Address set greater equal immediate unsigned 


ASET.U.U 


Address sec less immediate unsigned 


A5UB.I 


Address subtract immediate 


ASUB.I.O 


Address subtract immediate signed check overflow 


A.SUB.I.U.O 


Address subtract immediate unsigned check overflow 




ANEG 


Address negate 


ANEG.O 


Address neqate skjned check overflow 


ASET.GIU 


Address set or eater immediate unsiqned 


ASET.LE.I 


Address set less equal immediate skined 


ASET.LEJ.U 


Address set less equal immediate unsiqned 


ANEG rd*rc 


-» A.SUB.I rd»0.rc 


ANEG.O rdmrc 


ASUB.I.O rd-O.rc 


ASET.CH rdmlmmsc 


-» A.SET.GE.I rd«lmm* 1 jre 


73ET.GI.U rdmimmrc 


-» A.SET.GEJ.U rd-lmrml.rc 


ASET.LEJ rdmlmmrc 


-» A.SET.LI rd* Imnvl.rc ] 


ASET.LEJ.U rcMmnxrc 


ASET.LI.U rd«lmm-l.rc 
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Instruction Set 



ASET /NO.E.I rd-rc.0 




ASET rd 


ASFMND.NE.I rd-rcO 


«=> 


AZERO rd 


ASETAND.E.I 1 rd-rc-l 


«» 


ASET.E2 rd-rc 


ASETAND.NE.I rd-rc.-l 


c=> 


A5ET.NE2 rd~rc 


A5ET.E.I rd-rc.0 


c=> 


A5ETE2 rdmrc 


ASET.GE.I rdTf.O 


<=> 


A5ET.GE2 rdmrc 


ASETTi rd-rc.0 


<=> 


A5ET.L2 rdmrc 


A5ET.NE.I rd-rc.0 


<=> 


A5ET.NE2 rdmrc 


A5ET.GE.I.U rd-rc.0 


<=> 


A5ET.GEU2 rdmrc 


A5ET.LI.U rd-rc.0 


c» 


A5ET.LU2 rdmrc 



Setectfrn 



class 




cond 


form 


type 




arithmetic 


SUB 




1 




o 


boolean 


SET AND SET 
SET 


E NE 

I GE G LE 


1 
1 


s 





Format 

op rdmknmjc 

rd»op{imm f rcJ 
31 



24 23 



18 17 



OP 
8 



12 fl 



rd 



rc 



Imm 



12 



l>ie contents of renter rc is fetched, and a 64-bit immediate value ii si^n-cxtended from the 
12-hit ,mm ficU. The specified operation is performed on these operands. The result is 
placed into register rd. 

Definition 

def AMTculmrr^U*c\op.r6 t rzMr*Tl as 
I *- mvnfj 1 1 knm 

c ffegReadfrc, 64) 
case op of 
ASU8.J: 

a «- I • c 
ASUB.I.a 

* t*« # then 

raise FtaedPotntAnthmetk 

endif 
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AiMrru I mmrtliate KrvrnnJ 



ASUBIUO: 

< 063 I I 4 - K6J > I C| 

rf t*4 0 0 then 

raise FuedPotntAnthmrtx 

end* 

* I6J..0 
ASTT^NOEJ: 

a «- {(1 and c| - OJ* 4 
ASCT-AND.N6.I: 

a |(i and c) * 0)* 4 

ASCT.E.I: 

a 4- (I m c) 64 

ASn.N£J. 

a «- |i # cl 64 

ASC7X.I: 

a «- |» < c| 64 
ASCT.GE.I: 

a (• 2fc cj 64 
ASCT.LI.U: 

a 4- |(0 1 1 « < (0 II c|)* 4 
ASET.GC.J.U: 

a ((0 1 1 I) i (0 I I c|J 64 

endcase 

RcgWrrtefrd. 64. a| 
endrfef 

l ord po«nt tnrhmenc 
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AdditM ftrv»rwd 

Address Reversed 

llicse operation! perform c a] cd adorn with two general register values, placing the result in a 
general register. 



Operation codes. 



ASETAND.E 


Address set and equal zero 1 


ASETAND.NE 


Address set and not equal zero 1 


ASET.E 


Address set equal 




Address set greater equal siqned 


ASET.GE.U 


Address set qreater equal unsigned 


ASET.L 


Address set less siqned 


ASET.LU 


Address set less unsigned 


ASET.NE 


Address set not equal 


ASUB 


Address subtract 


ASUB.O 


Address subtract siqned check overflow 


ASUB.U.O 


Address subtract unsiqned check overflow 


EguiYalcncics 


as&tje* 


Address set equal zero 


A5£T.G.r 


Address set qreater zero siqned 


A5E7.GE.Z 


Address set qreater equal zero siqned 1 


AJET.I-? 


Address set less zero signed 


A5fT.i£7 


Address set less equal zero signed 


ASET.NE2 


Address set not equal zero 


A5ET.G 


Address set qreater signed j 


A5ET.C.U 


Address set qreater unsigned 


ASET.LE 


Address set less equal signed 


A5ETJ.E.U 


Address set less equal unsiqned 1 




ASET.E. Z rd-rc 


<- ASETAND.E rd-rcrc 1 


ASET.G.Z rdmrc 


<= ASET.LU rd-rc/c 1 


ASET.CE2 rdmrc 


<= ASET.GE rd>rc.rc 


ASE1.L2 rdmrc 


<= ASET.L rd»rc,rc 


ASET.LE2 rdmrc 


<= ASET.GE.U rd-rcrc 


ASEJ.NE2 rdmrc 


♦- ASETAND.NE rd-rcrc 


ASFT.C rdmrtxrc 


-> ASET.L rd»rcrt) j 


ASET.C.U rdmrUrc 


-♦ ASET.LU rd»rc.rb 


ASE7.LE rdmrkrc 


-* ASET.GE rd«rcrb 


ASET.LEU rdmrtxrc 


-» ASET.GE.U rd-rcrb 
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AiUre« Kevenett 


1 ASET.E rd 


=rc.rc 


» \SFT rd 




|A>FTNE rcforcrc 


« AZERO rd 


1 


Selection 






class 


Ofjrvation 


cond 


opcrana 


cneck 


arithmetic 


SU1 














NONf U 


O 


boolean 


SET AND SET 


E NE 








SET 


L GE G If 


NCWI U 






SET 


L CE C LE E NE 


2 




Format 










op rfcrbsc 








rcfeopfrb/c) 
rd=opz(rcb) 

*1 24 23 


m r 12 


11 6 5 


u 


I A. MINOR | 


rd | rc 1 


rb | 


op I 




8 


6 6 


6 


6 



rc «- rb 4- rcb 

Description 

Tnc contents of registers rc and rh arc fetched and the specified operation is performed < 
the*? operands. IV result is placed into register rd. 

Definition 

def AddressAevcrsedfc rd.rc.rt) «»j 

c 4- PegRead|rc 128} 

t «- KegReadfrb. !28f 
case op of 
ASFTf : 

a (b » c) 64 

ASETNE: 

a «- |b # c| 64 
ASfT/NO£ 

a ||b and c| • Of 64 
ASET-AND.NE: 

a |fb and cj n Of 64 
ASH » . 

a 4- ||rc ■ rb| ? p < 0) fb < cj) 64 

ASET.GE: 

a «- |(rc » rb| ? fb 2 0) fb 2 c)| 64 
ASET.LU: 
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a +- «rt r rfaf ? |b > 0) : (fO I I q < jo II cll^ 4 
AS£TG£.U. 

a «- «rc - rn| ? fb * 0) : ffO II b) * (0 1 1 cj) 64 

A SUB: 

a b • c 
Asueo 

I (b63 ■ » « • K63 I » CJ 

* *64 * t6ji then 

raije FucdPofntAnthmetic 

end* 

* «- *65 0 
ASUBUa 

t «- (0 1 1 1 bf - fO' lie) 

* (64 * 0 **n 

raise FuedPotntyVithmetx 

enow 

a t6 3 o 

endcase 

*egwntefr<t 64. a) 
enddef 

l uctl ptwil Arithmetic 
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Address Shift Left Immediate Add 

The*: operations perform calculations with rwn gcncraJ register values, placing the result in a 
general register. . 



QpgOtiBQ CQdg 

I ASHLIADD | Address shift left immediate add ~ 

Format 

ASHLIADO rd=rcrb.i 
rc*op(ra.rb.i) 

3] 2122 1*11 t? M 65 ?i o 

j A.MINOR } rd | rc | rb ^"^Ishj 
5 6 6 6 6 2 

assert 



Description 

The contents of register rb are shifted left by the immediate amount and added to the 
contents of register rc. The result is placed into register rd. 

De finition 

def Addr«iS^*/tlrftJmfnedi^eAdd{sn.rd.rt/to| as 
c #- fttgfteadfrc. 64) 
b RegRead|rtx 64; 

a c ♦ |b 6 2.jn o 1 1 O 1 **^ 
Kegwntefrd. 64, a) 
enddcf 

Exception} 

none 
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[A.SHU.SUB Address shift left immediate subtract 



Format 



ASHLI3UB rdxrbXrc 



rd*op(rbj.rcJ 

21 24 23 18 17 



I A. MINOR | rd | 



»2 II 65 21 0 



8 6 



rc 



6 



6 6 2 



assert 1^4 
sh i-1 



BcgfiBtiflO 

The contents of register rc is subtracted from the contents of register rb shifted left by the 
immediate amount The result is placed into register rd. 

Definition 

def AddressShrfttettmme<J^eSu as 
c *eg*eadfrc. 128) 
b RegReadfro. 128) 

(*62-sno l» 0'^ - c 
RcgWnttfr<t 64. *| 
enddef 
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Addrrtt Shifi Immnluie 



Address Shift fmmed^rp 



These operaoom perform cakulaoons with one general register value ami onr immediate 
value, placing the rcsuh tn a general register. 

Operation coda 



ASHLI 


Address shin left immediate 


ASHLIO 


>*wress shirt left immediate signed check overflow 


ASHL1.U.O 


Ad^ess shirt left immediate unsigned check overflow 


ASHRJ 


Address signed shift right immediate 


aIhrTu 


Address shift right immediate unsigned 


Redurvtergcj 


ASHLI rd=Mrcl 


<=» AADO rd»rcrc 


ASHU.O rd=rcl 


<=> A/OD.O rdsrerc 


ASHU.U.O rd-rc« 


c» AADD.U.O rd«rcrc 


ASHU rd=rc.O 


<=> ACOPY rdmrc 


ASWJ.O rd=rc.O 


<=> ACOPV «*rfc 


ASHLLU.O rcfcrc.0 


» ACOPY rd-rc 


A&HRJ rd=rcO 


<=> ACOPY rdmrc 


A5HRJ.U rd=rcO 


<=> a copy «fc/r 



SrieataQ 



class 




form 


operand 


check 


shift 


SHL 


1 












fcONEU 


O 




SHR 


1 


fmU 





Format 

op refcrcsimm 

rd*op(rcsimrn) 
31 



A.MINOR 
8 



24 73 



18 17 



12 II 



6S 



rd 



slmm | op | 



Description 

The contents of register rc b fetched, and a 6-bit immediate value is taken from the 6-bit 
stmm field The specified operation is performed on these operands. The result is placed into 
register rd 
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AddressShtfflmrnediacelQp^d^ as 
c «- KegKeadfrc. 64) 
case op of 
ASHU 

A.SML10: 

* <63 63-Wnm * cjjj^' then 

raise FuedPointAnthmeoc 

endif 

ASHUXJO. 

* *63 .64-*mt * 0 then 

raise Ftae*H**Vtthmet>c 

end/ 

* «- <63-s*nnU> « « 0*™ 
A.SHRJ: 

ASHRJ.U: 

a «- &*r™ | | c M jfcnm 

endcase 

flegWhtefrd. 64. ai 
enddef 

I'urd pomt arithmetic 
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Add ret* Ternary 

Address Ternary 

These operations perform calculations with three genera) register values, placing the result in 
a fourth general register. 

Operation grig 

lAMUX [Address mtMplzx | 

Format 

op rasretrerb 
rasamuxfrclscjt)) 

31 24 23 13 17 [21] 65 0 

I op I rd | rc | rb 1 ra | 
8 6 6 6 6 

Pexrfption 

The contents of registers rd, rc, and rb arc retched. The specified operation is performed on 
these operands. The result is placed into register ra. 



def AekJresTetrwytopsclrcrtxra| as 
d «- ftegOeadfrd. 64| 
c «- RegReadfrc. 64J 
b «- RegReadfrtx 64) 
endcase 
case op of 
AJUUX 

a 4- (c and 0} or (b and not d| 

endcase 

RegWhtefra. 64. a} 
enddef 

none 
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Branch 

This operation branches to t location specified by a register. 

Operation mda 

IB jBranch [ 

Format 

B rd 

31 24?) 18 17 12 11 6$ 0 

I B. MINOR 1 rd 1 0 | 0 1 B | 
8 6 6 6 4 

Pexripfoo 

Execution branches to the address specified by the contents of register rd 

Access disallowed exception occurs if the contents of register rd is not abgned on a quadkf 
boundary. 

DfifiOffifiQ 

def Branchfrd.rc./t*; as 

rf|fti»0|or|rt>itO| then 
raise ReservecMnstrucoon 

eodW 

d — KegRcadfrd. 64) 
|d|_o) * 0 then 

race Acce^sOisaaoweceyWiualMdress 

enoW 

ProoramCounter «- cJ^jj 1 1 O* 
ratse TafcenBranch 
enddef 

Exceptions 

Rncn-cd InMructKM 

Accru J wAwwd by virtual toldmt 
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Hr»nch tWk 

Branch Back 

This operation branches to a location specified by the previous contents of register 0, 
reduces the current privilege level. loads a value fum memory, and restores register 0 to the 
value saved «m a previous exception 



Operated coda 

I B, BACK 1 Branch back 

Format 



B.BACK 
bbacfcO 

31 24 23 18 17 12 11 6 5 0 

I B .MINOR | Q | Q | Q | B.BACK 1 

^ 8 6 6 6 6 ' 

Pacrlpdon 

Processor a hi text, including program counter and privilege level is restored from register 0, 
where it mas saved at the last exception. Exception state, if set, is cleared, re -enabling normal 
exception handEng, The contents of register 0 saved at the last exception is restored from 
memory. The privilege level is only lowered, so that this instruction need not be privileged. 

// the previous exception w*$ an AccessDetaii exception, Continuation State set at 
the time of the exception affects the operation of the next instruction after this 
Brmnch Bacm\ causing the previous AccessDetaii exception to be inhibited. If 
software is performing this instruction to short a sequence ending in mn AccessDctnJI 
exception, it should short by branching to an instruction that is not affected by 
Continuation State. 

QsLri&QQ 

def BtanchBtcktrdstrbl as 
c flegfteadfrc «28| 
* |rd * 0) orfrc 0 0| or |rb 0 0) then 
fane ResctvedJnstrucoon 

end* 

a *- lM<»AemorytLMCCptKnBaic.EACc& 
if Pnv*tj?levet > C|„o t**n 
PrMegeLevei c\_o 

ends] 

PToqratnCountet *- c 6 j 7 1 1 0* 
Estepbon&ute *- 0 
neg^tnafdJ2BJt 
raise Tj**n8r*nc/<ontinoe 
enddef 
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Bnodi Back 



tUtrrved laatiucttoa 
Acceat diaa fl o w td by vuum\ addrtM 
Accce* dteaflowtd by tie 
Accrndbaalovcd by global TB 
Access dnafiowrd by local TB 
.\cceat detail mpaxed by tag 
Access drtatl required by local TB 
Access detail required by global TH 
Ixxral I* mm 
Global mm 
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BrtiKh Harncr 

Branch Barrier 

This opetatmn snips the current thread jnril all pending stores are completed, then branches 
to a location specified by a register. 

Operation coda 

I B.BARRIER I Branch barrier 



Format 

BBARRIER rd 
btonicr(rdj 

31 2423 1817 1211 65 0 

I BMINOR I rd | 0 | Q 1 B.IIARIclE*l 
8 t 6 6 6 

Qqgtetiflo 

The instruction fetch unit is directed to cease execution until aO pending stores are 
completed. Following the barrier, any previously pre fetched instructions are discarded and 
execution branches to the a *drcss specified by the contents of register rd. 

Access disallowed exc prion occurs if the contents of register rd is no? aligned on a quadlct 
boundary. 

Sdf-modifying. dynamically-generated, or loaded code may require use of this instruction 
between storing the code into memory and executing the cede. 

Definition 

def Brancn8am«trd/crt3| as 
rffrc»0)orfrt>*0) then 
rate Reservedlnstruction 

end* 

d +- ReoReadfrd, 64) 
* Idj jbt # 0 then 

rate Acceu&sMlc**<3Qf*nualMdrni 



ProoramCourtter d6j 7 H 0* 
Fesrnearnerfl 
rate TafcenBraoch 
enddel 
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Bunch CondttmJ 



Branch Conditio nal 

These operations compare two operandi, and depending on the result of that comparison, 
conditkinaDy branches to a nearby code location. 



O peration codes 



B.AND.E 


Branch and equal zero 


BAND.NE 


Branch and not equal zero 


BE 


Branch equal 


B.GE 


Branch qreater equal signed 


Bl 


Branch jkjned less 


B.NE 


Branch not equal 


B.GE.U 


Branch qreater equal unsiqned 


B.LU 


Branch less unsiqned 






Branch equal zero 


B.G2* 


Branch qreater zero signed 


BGE2 1 


Branch qreater equal zero siqned 


B.L7* 


Branch less zero siqned 


BLE7* 


Branch less equal zero siqned 


B.NE2 


Branch not equal zero 


BLE 


Branch less equal siqned 


BG 


Branch qreater siqned 


BLEU 


Branch less equal unsiqned 


B.G.U 


Branch qreater unsiqned 


B.NOP 


Branch no operation 



*H.C.Z h encoded m ft I.U «vh borfi main* non field* nl and rc ctjuaL 
'ft i;H /. m encoded •• H.i«l'. wh*i bnrti miirucrvwt Arid* rd and re t^ul 
*H UX n enendrd m H I. wtrn born marnarftno firtdt nl and rc equal 
*HJJ-/. » encoded at H.GICU wtrh born tmirucnon field* rd and rc eo^al 
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Hrenrfc Condition^ 



RE 2 retarget 


«— 


Bj^NDE rcrctarget 


RG2 retarget 


c= 


B.LU rcrctarget 


RGE2 retarget 


c= 


B.GE rcrctarget i 


RLZ retarget 


c= 


aL rcrctarget 


RLE2 retarget 




aGE.U rcrctarget 1 


RNE2 retarget 


«- 


B-AND.NE rcrctarget 


RLE rerdLtarget 




aGE rd.rc target 


RO rerdtarget 


— > 


aL rd.rctarget 1 


RLE.U rerdtarget 




B.GE.U rd.rctarget 


RG.U rerdtarget 


— » 


B.LU rd.rctarget 


RNOP 


4- 


B.NE r0/0,$ 




B.E rc/ctargct 




ai target | 


aNE rcrctarget 




RNOP | 



dasj 


op 


compare 


type 


arithmetic 




L GE G LE 


NO* U 


vx zero 




L GE G LE E 

NE 


z 


bftwbe 


none AND 


E NE 





Format 



op rdsctarget 

If (opfrcUc)) goto target 

*J 34 23 tB >7 J2M q 

I °P I rd I rc I offset I 
a S S 12 1 

Demotion 

The contents of registers rd and rc are compared, at specified by the op field. If the remit of 
the comparison ii true, execution branches to the address specified by the ofTset field. 
Otherwise, execution continues at the next sequential instruction. 

DcftaMflQ 

def BrwhCond»ttonally<op.rd.rc.offset> as 
d 4- ftegfte*d(rd. 128) 
c #- ftegftead|rc, 128) 
cast op of 
EC: 
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a «- d > c 

8JJE: 

a d * c 

&ANO£: 

a «- (d and cj > 0 

j 4- |d and cj # 0 

ai: 

a «- (rd - re) ? fc < 0) |d < c| 

aG€: 

a*-frd-rc)?fciO|:|d*c) 

BXAJt 

^'♦-(rd.rcj ?|c>C|:BO It d)< fO || q| 
a |rd - rcj ? fc * 0»: 00 1 1 d) a (0 II ctl 

endcase 
if a then 

*ogramCounter frogramCounter ♦ foffsetft 1 1 offset 1 1 Q*\ 
raise Tokendranch 

end* 



Exceptions 



so 
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Branch Conditional Floating-Point 

These operations compare twit (taring-point operands, and depending on the result of that 
companson, conditionally branches to a nearby code location. 



Operation codn 


ae.F.16 


Branch equal floating-point half 


BE.F.32 


Branch equal floating-point single 


aE.F.64 


Branch equal floating-point double 


aEJ.128 


Branch equal floating-point quad 


aGE.F.I6 


Branch greater equal floating-point ha* 


aGE.F.32 


Branch greater equal floating-point single 


aGE.F.64 


Branch greater equal floating-point double 


aGE.F.128 


Branch greater equal floating-point quad ] 


aLF.I6 


Branch less floating-point half 


aLFJ2 


Branch less floating-point single 


aLF.64 


Branch less floating-point double 


aLF.128 


Branch less floating-point quad 


aLGJ.16 


Branch less greater floating-point half 


aLGJJ2 


Branch less greater floating-point single 


aLGJ.64 


Branch less greater floating-point double j 


aLOF.128 


Branch less greater floating-point quad 


EouiYaferKtes 


&LE.FI6 


Branch less equal floating-point hatf \ 


RLE.F.32 


Branch less equal floating-point single 


RLE.F.64 


Branch less equal floating-point double 


ALES. 128 


Branch less equal floating-point quad 


BLCiF.16 


Branch greater floating-point half 


RG.FJ2 


Branch greater floating-point single 


&G.F.64 


Branch greater floating-point double 


&G.F.I28 


Branch greater floaong-point quad J 




tXLEJMze rcsd.target 


B.GE.F.si*e rd,rctarget 


RCFsLte rcrd. target 


-> B.LF.size rd.rctargct 



Selector 



number format 


type 


compare 


size 


rtooung-potnt 


F 


E LG L GE G LE 


16 32 64 

128 | 
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Branch CoaefcaoMj Moaang ISmmI 



EflOMS 

op rd.rctargct 

if (opirOrcJ) goto target 

31 24 23 IS 17 I? I > Q 

I QP 1 rd I »c | off ft 

8 6 6 17 



Pcxrirton 

The concerns of registers re and rd are compared, a* specified by the tip field. If the retuh of 
the riimpariaon is true, execution branchea to the address specified by the offset field. 
Otherwise, execution continues Jt the neat sequential instruct* m 

PCTiofPon 

oef t>jnchCorKMorwMff ^JOnoPomtop.fttrcoffseiJ as 
case op of 

af J 16. aiGF 16. BXf.16. aG£/.l6: 

»re +- 16 
aC f 32. aiG F 32. aLF J2. a«/ 32: 

Mf 32 
B£F*4. aiG F 64. BXF 64, a« F 64: 
SUV 64 

8 C.F.I 28. aiGF I2& aLF.I2& aG£.F.I28: 
utt 128 

endcase 

d Ff«e.*eo*ead|rd, 12*5 
c «- F|a*eJ*eg**adfrc I28tf 
v «- fcomfdl cj 
case op of 

BCFI6. 8£F32. BCF64. BEFI2& 

4 «- N - Q 
81GFI6. BLCF32. 61CF64. BLGF 1 28: 

a«-fv.Uorfv.G) 
BLFI6. 6LF32. BIT 64. BIFJ28: 

a »- fv • g 

8GCF16. 8GCF32. BGCF64. BGCF128: 
a«-fv«Gjorfr*E| 

endcav 
# a then 

Proof amCounter «- Proof amCountcr ♦ (offset?? 1 1 offset 1 1 0 2 ) 
raise TafcenBranch 

end* 
enddet 
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Branch Conditional Visibility Floating-Point 

These operations compare two gnnip floating point operands, ami depending iki the mult 
of that companwm, conditionally branches m a nearby code locanon. 



Q pecaflap codes 



ai.F.32 


Branch invisible floatingpoint single 


aNI.FJ2 


Branch no< mvtsicite noting pom ingle 


aNV.F.32 


branch mt visible Aomrvj pcwx tingle 


av.F.32 


Branch visible floating point single 




number format 


type 


compare 


size 


floatino-pofnt 


F 


I NJ NV V 


32 



Format 

op rcrct target 

if (opfrcrd)) goto target 

31 24 23 16 17 12 M 0 

I QP I rd | rc | offset 1 

8 6 6 12 

QgasDDSiaa 

The contenu of registers rc and rd arc compared, at specified by the op field. If the result of 
the comparison is true, execution branches to the address specified by the offset field 
Otherwise execution continues at the .text sequential instruction. 

Each operand is assumed to represent a vertex of the form: |w x y x) packed into a single 
register. The comparisons check for .isibdity of a line connecting the venices against a 
standard viewing volume, defined by the planes: * = u ,x = w,y = w,y = w.x =0,z = 1 . A line is 
visible (V) if the vertices arc both within the volume. A hoc is not visible (NV) is either 
vertex is outside the volume - in such a case, the line may be partially visible. A line is 
invisible (I) if the vertices are both outside any face of the volume. A line is not invisible 
(NJ) if the vertices are not both outside - ny face of the volume. 

BcfinitiflQ 

def n|a| as fa.uONAN) or |a t-SNAN) enddef 
def tessjAty as fcomfa.bK enddef 

def trxyaAc.dl as ffcomffate(aj.bt«G| and |fcom(f*bs|<j.dhG| and |a.s-c s| enddef 
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feme* CoodMQMl Vo*** HnMf Pan 

def BrarxhCondiCftorwWts*^ * 
d *eg«eadfrd. l?q 
c 4- Regtfeadfrc. 
<te F|J2.dj| .ol 
c* 4- Ff32xj|. ol 

cy 4- F|32.c 6J J? | 

A 4- F|J2.d9$ 64) 

Cl 4- F|32.C9S *4» 
dw F(32.d» 2 7 . V6 ) 
cw 4- FJ32,C|77 f J 

n +- F|32.(U7fDOOOO0| // floating-point 1.0 

* fnfd*) or or n**} or n|d»f or n|u) or r*>f or r*j) or nfcv* tf«n 
ete 

ov 4- lessjfaosfr'Aj.dj) and tes*afaifo>t<U} and tessJ<fc.M) and fd*.-'* 
cv 4- ieu<f*6v4.cj| and teisjrab*tcyta| and msfu.fi) and 
tri 4- ftessjf i.<uj and tess0l.ua or ffd**0 and c«.lj 
tr 4- tmyfoxdi.cjtu) or tncyfoV.dz.cy.ci) or tr* 
case op of 
ai.F.32: 
a 4- tr 

aw; 32: 

a 4. not 9 
RWF32 

* *- nof foV and cv) 

RV.F.32: 

a 4- dv and cv 

endcase 

end* 
if a then 

Proof amCounter 4- FrograrKounter ♦ Joffsetffi 1 1 offset 1 1 0*) 
raise Takendranch 

enoW 
enddef 

Exception* 
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Branch Down 

*thi% operant m Uranchos to a Iih Jthm «pcc.'icd h> a rcgiMcr, rct'.ucin^ the cum-ni privilege 
level 

QgSUOQD codes 

1 a DOWN | Branch down \ 

EfiODfll 

aDOWN rd 
bdown(rd) 

31 24 ?J 18 17 17 11 6S 0 

1 B.MINOR I rd ] 0 | 0 | B.DOWN 1 
a 6 6 6 6 

Description 

Execution branches to the ftddrcu specified by the conm.ti t f remitter rd. The current 
pmilcgc level is reduced to the level sprcificd by the low order two bits of the c- tents of 
register rd 

Definition 

def 8rjncnDowTyrd.rt.rt* as 
rf|rc*0|ortrt>»G| then 
raise Rev^rwdirstrucoon 

end* 

d &*gRe*d(rd. 64) 
PnvUegeleve* > d| o then 
PrivOegelevel dj 0 

•ndrf 

r-rogramCoiriter d 6 j ; I I 0* 
tuise Uacodranch 
enooer 

R nerved !i*»liuclr»« 



Micnil'nity 



/cu% S\Mcm Architecture l uc. Aug !"\ |W° Instruction Srt 

Branch Gateway 

11m rpcratior provides a secure im-ins t» call a pmcedurc, inJuding those at a higher 
pnvilcgc level 

OocMtion codes 

jBGATE j Branch gateway | 



Equrvafenoc s 

IB GATE aCMl 0 



F ormat 
B GATt rb 



bgatefrbj 

31 2'*) 18 17 12 11 6S 0 

1 B. MINOR I 0 j 1 | rb 1 B GATE 1 
a 6 6 4 6 



Description 

"IV contents of register nS is a branch address in the high-order ft? hit* and a new privilege 
level .r rhe W order 2 tuts. A branch and link occurs to the branch address, a^d the 
pnvdege level is raised to th new pnvilcgr level. IV high-order 62 bits of the successor to 
the current pmgrw. counter is catenated with the 2-bn ci"rcnt execution privilege and 
placed tn register 0. 

If the new privilege level is greater than the current privilege level, an octlet of memory data 
is fetched from the address specified by register I, using th? bttle cndian hytc order and a 
gateway access type. A Gateway DuaJlourd exception *h urs if the original contents of 
raster 0 do not ctjual the memory data. 

It the new prvi!ege level i> the same a- the current pnvilcgc level no checking of register 1 
i> performed 

An Acccs^DisiJIoucd exception occurs if the new privilege le* el if greater than the privilege 
level required to wnte the memory data, or if the old pnvilcgc level is lo*cr than the 
pnvilege required to access the memory data as a gateway, or if tin acres* is not aligned on 
an S byte boundary. 

A Rcscrvedlnstruction rxecprion occurs if the rc field is n«n one «k the nl field is not xero. 

In the e: anplc below, a gateway from level 0 to level 2 is illustrated. THr gateway pointer, 
k«atcd by the contents of register rc (l\ is fetched from memory and compared against the 
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hrvnrh l'iatew»y 



conunts «»t register rb ^») IV* instruction nu> only complcu- it ilwsc values arc cuua). 
l.oncurrentry, the cimtcnii of register rb (U) is placed in the program counter and pnvilcgc 
level, «mi the address of the next sc^jocnnal aikircss and privilege level i> placed into register 
rd (0). Code at the target of tl* gateway Kicate* the data pointer at an offset from the 
gateway pointer (register 1), and fetches it into register 1, making a data tcgion available. A 
stack pointer may be saved and fetched using the data region, another region located from 
the data region, or a data region located as an offset tr *m the original gateway pointer. 



rS wJ x2 fl3 




Branch gateway 

For additional information on the branch-gateway instruction, sec the System and 
Privileged Library Cills tectum «>n page 

This instruction ghes the target procedure the assurances that register 0 contains a 
vali • return address and privilege level, that register I points to the gateway location, 
and that the gateway location is octlet aligned. Register I can then he used to 



107. 



MicroUniry 



Zeus System Architecture 



Tuc. Auk ,7 . » v « 



Instruction Set 



securely reach value* in memory. If no sharing of litem] pooh is desired, register I 
msy be used as s titers! pool pointer directly. If sharing of titers! pooh is desired, 
register I msy be used with an appropriate offset to losd a new literal pool pointer; 
for example, with a one csche line offset from the register /. No* thst because the 
virtus! memory system operstes with cache One granulsriry, dtjt severs! gateway 
locations must be crested together. 

Software must ensure that an attempt to use any octkt within the region deaignsted 
by rirtusl memory as gateway either funrdons properly or causes s legitimate 
exception. For example, if the sdfscent ocHets contain poiuters to literal pool 
locations, tofrware should ensure thst these metal pooh are not executable, or that 
by virtue of being aligned addressee, cannot raise the execution privilege level If 
register I h used dtrectfy as s titers! pool lo ca tio n , software must ensure thst the 
titers! pool locations thst are accessible ss s g*t*w%y do not hsd to a security 
violation, 

Register 0 contains s vahd return addres* and p+rrocg* Urei, the value h suitable for 
use directly m t*r Branch-down (B.DOWN) instruction to return to the gateway 
caJhe. 

Definition 

del aranchGattwayfrd.rt.rtl as 
c «- RegReadfrc 64) 
t> «- ftegfteadfrb. 64) 
4 |rd » 0| or frc * M then 
raoe fceservedtnsmjcton 

end* 

* *2-0 * 0 then 

raoe Acce^sDsa«ovved6yWtuai^dress 

endrf 

6 #- ProgramCountef 53 II PnviJe*}eteve< 
if rYMtegetevef < t>i o ^tn 

m «- Laac**efrwryGfc.c.64.q 

i 6 * m then 

raise Gateway Di^Howed 

end* 

F*r#vt»egeieve* •- Di o 

enoW 

ProgramCounter #- 2 1 \ 0 2 
RegMATteJnl 64. d| 
raise Takenfiranch 
enddef 

Rttcfvcd Instructkcm 

G»rrw*y tStuSowcd 

Acccw dnaflowed by vviu*l *Jdrru 

Accett AuBowvd by tag 

Awe*. a^attoweJ by j0ot»l 1*H 

Acccm dntliowed by fcoc*l TB 

Acer** drtal rr<p*ml by tag 
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Hrancn CUtrway 

Accr^ drtad mfmmd bt local 1* 
Acrw» 4ml wqnttd by ||ob«l Til 
UxalTKam 
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Branch tuk 

Branch Halt 

This operation stops the current thread unal sr. excepoun «curs. 

IBXALT j Branch h*t j 

fgffTWI 
3. HALT 
bhaftfl 

3j 24 73 t>t/ I2M 65 0 

| B. MINOR 1 0 | 0 | Q 1 B.HALT 1 
8 6 6 6 6 

llits instruction directs the instruction fetch unit to cease execution unni an exception 
Kcurs. 

Definition 

def 8ranchHalt|rd.rcrtaj as 

4 frd • OJ or frc - _ # or frt> # 0| Chen 
RescrvedJnstrucfcon 

end* 
FetcnHaltO 
rnddef 

Exceptions 

Ketcnrd tntiroctro 
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Branch Hint 

I'his opcraoon indicates a future bnncl. location specified by a register 

I MKT I Branch Hint | 

Format 

aHUSTT badctcounud 
bhint(t>acklcounud| 

*! 24 23 1817 1211 65 0 

I BMtNOR I rd I count 1 sjmrjj I B.HlrVTH 

• * 6 6 6 

»mm 4- badd-pc-4 

Dsgngfeo 

Tan instruction directs the imtruction fetch unit of the pmccssor that a branch is hkcly to 
occur aunt tunes at simm instructions following the current successor instruction to the 
address specified by the contents of register rd. 

After branrhmg count rimes, the instruction fetch unit should presume that the branch at 
•imm tturucooas foDouintf the current successor instruction is not likely to *>ccur. If count 
n aero, this runt directs the instruction fetch unit that the branch is likely to occur mr ire thin 
63 rimes. 

Access disallowed exception occurs if the contents of register rd is not aligned on a quadlct 
btntrKiary 

Prtnrfon 

Oef ararchrsv^dxoonuimfnl as 
d *- RecjReadfrd. 64] 
# fdl nj * 0 then 

raise Acce^sailc**e<JByVirtu*iAddress 

end* 

f«r#*ni(Pro9ranr<cKjnter #4 ♦ |0 1 1 simm 1 1 d 63 2 1 1 0* count) 
enddef 

Acctm dbuftowed by virtual addrttt 
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Branch Hint Immediate 

This rjperaoon indicates i future branch fatfation specified as an offset from the program 
ci winter. 

QBgateQ coda 

1 BMN1A ; Branch Hint Immediate ~" ~ I 



Format 

B.HINT.I bactt.count, target 
bhintifbackUount target) 

31 24 23 18 17 1211 0 

I 8.HINTJ 1 simm T count I offset | 

8 6 6 12 

»mm «- badd-pc-4 
BBCOOtta 

This instruction directs the :rwrrucrxm fetch unit of the processor that a branch is likely to 
«<cur count tunes at simm instructions following the current successor instruction to the 
address specified by the offset field 

After branching count cmcv the instruction fetch unit should presume that the branch at 
simm instructions following the current successor instruction is not likely to occur. If count 
is zero, this hint directs the instructs* m fetch unit that the branch is likely to occur more than 
63 nmcs. 

Pgfirwfon 

cfcf Branch! kntkrme<3% Mefim yn\xo^Kcifie% as 

aranchH^^ooramCountrr ♦ 4 ♦ (0 It umm I I 0 2 l count 
Progr^mCountrr • &hei]) 1 1 offset 1 1 0*|) 

enddcf 
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orancn immeuiate 




Thi$ opcram * branches to • location that it specified as an 
cininter. 


offset from the program 


Operation codes 




1 B 1 1 Branch immediate 


i 


Redundancies 


|ai target « B.E retarget | 


format 
BJ target 




bl{target) 

31 24 23 


0 


1 BJ I offset i 


a 24 1 



Pcxrtptfon 

Execution branches to the address specified by the offset field. 

Definition 

del BrarKNmmediatefolTset} as 

ProgramCouncer «- ProgramCounter ♦ (offjet)? 1 1 offset 1 1 0*) 

raise TakenBranch 
enddcf 

Exceptions 

none 
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feme* ImmetWr U* 

Branch Immediate Link 

Thu operation branches 10 s location that is specified as an effect from the program 
c Hjntcr. saving the vaJuc of the program counter in to register 0. 



1 b.LJNJU | Branch Immediate link | 

formal 

B.LIJS0CI target 
bfinkj(targct) 

31 24 23 0 
I B.UNK.I | offset 1 
8 — 24 1 

PcKripfon 

The address of the instruction following this one is placed into register 0. Execution 
branches to the address specified by the offset field. 

Definition 

def BrancNmmct*atrUnfc|orTjc<) as 

RegWritefO. 64. Prc^amCounter ♦ 4| 

ProgramCounter ProgramCounter ♦ foffsetJJ 1 1 offset 1 1 0*) 
raise laken&ranch 
enddet 

none 
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Branch Link 




"I "his i*fx tjn<»n branches t< 
counter into a register. 


» a location specified by a rcgistct, wing the value of the program 


Operation codes 




| B UNK 


| Branch link 1 


EaujVdlCQQCS 


&UNK 


B.UNK 0«0 | 


[BUNK rc 


a- B.UNK 0»rc ~j 



Format 

B.UNK rd-rc 

.1! * 4 p L SJLZ \iM 65 0 

I B.MINOR I rd 1 rc I 0 I B.LINK I 
8 6 6 6 6 

rb 0 

The address of the tnitruction following this one is placed inio register rd. Execution 
branches to the address specified by the contents of register rc. 

Access disallowed exception occurs if the contents of register rc is not aligned on a uuadJct 
boundary. 

Reserved instruction exception occurs if rb is not zero. 

DstkMOaa 

def BtanchUnkird.rcjVI as 
4 rb 0 0 then 

raise ffeservedlnstruction 

end* 

c «- RegPead|rc. 64) 
if K and 3) # 0 tnen 

raise Acc«sT>safk>wed8yVrrtua(Addrcis 

ends? 

ffegWfftefrd. 64. ProgramCounter ♦ 4J 
ProqramCountcr *- c^j 7 I I 0^ 
raise TakenBranch 
enddef 
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Rrten-ed Iiwtnicfaoa 

Acccm (fcttMowvd by rutual addfCM 
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l<oad 

Load 

l^iesc "pcntiims compute a virtual address from the contents of two registers, l<»ad data 
trom memory, sijm- or zcn»-cxrcndin)» the data n» fill the definition register. 



Operation codes 



L8 >0 


Load signed byte 


L16.B 


Load siqned doublet big-endian 


L16AB 


Load siqned doublet aHqned btq -endian 


LI6.L 


Load siqned doublet Httle-endian 


L.16AL 


Load siqned doublet aHqned Httle-endian 


L32.B 


Load siqned quadlet biq-endian 


L32AB 


Load siqned quadlet aHqned biq-endian 


L32.L 


Load siqned quadlet little-endian 


L32AL 


Load siqned quadlet aHqned Httle-endian 


L64.B 


Load siqned octlet biq-endian 


L64AB 


Load siqned octlet aliqned biq-endian 


L.64.L 


Load siqned octlet little-endian 


L64AL 


Load sianed octlet aliqned little-endian 


LI28.B" 


Load hexlet biq-endian 


LI28AB" 


Load hexJet aliqned biq-endian ! 


LI28.L" 


Load hexlet Httle-endian 


L I28AL M 


Load hexlet aliqned little-endian 


LU.8'* 


Load unsiqned byte 


LU.I6.B 


Load unsiqned doubK-t biq-endian 


LU.16AB 


Load unsiqned dot ,iet aliqned biq-endian 


LU.161 


Load unsigned doublet little-endian 


LU.I6AL 


Load unsiqned doublet aliqned little-endian 


LU.32.B 


Load unsigned quadlet biq-endian 


LUJ2AB 


Load unsiqned quadlet aliqned biq-endian 


LU.32.L 


Load unsiqned quadlet littJe-endian 


LUJ2AL 


Load unsiqned quadlet aliqned little-endian 


LU.64B 


Load unsigned octlet big-endian 


LU.64A8 


Load unsiqned octlet aliqned biq-endian 


LU.64.L 


Load unsiqned octlet little-endian 


LU.64AL 


Load unsiqned octlet aliqned little-endian 



H 1-8 need not dtftinguith between little endian and lag endun oulcnng. nor between aligned and unaligned, as 
only a single byte » loaded. 

'•|. 128 ft need not distinguish lietween ngned and unsigned, at the hexlet fills the detlination register 
t: l+\A.\h need not distinguish between signed and unsigned, a* fhe heslet fills the destination register 
M l. 128.1. need not disnnguish Ufween signed and unsigned, a* the heslrl fills the destination register. 
,4 M2*.AI. need not distinguish lietween signed »nd untuned, at the heslet filU the destination register 
l4 LU» need not disnnguish between tittle endian and btg endian ordenng. nor Iwtween aligned and unaligned, 
at onK a ssngle byte it loaded. 
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number format 


type 


size 


alignment 


o'derinq 


signed byte ? 


8 






unsigned byte 


U 


0 






siqned intcqer 




16 32 64 




L B 


siqncd integer aligned 




16 32 64 


A 


L B 


unsigned integer 


U 


16 32 64 




I B 


unsiqned integer aligned 


U 


16 32 64 


A 


L B 


register 




128 




L B 


register aligned 




128 


A 


L 8 



Format 

op rd=rcrb 

rd=op(rc,rb) 

31 24 23 18 17 12 11 65 0 

I L. MINOR | rd | rc | rb | np | 
6 6 6 6 6 

Pcxripfon 

An operand size, expressed in bytes, is specified by the instruction. A virtual address is 
c« imputed from the sum of the contents of register rc and the contents of register rb 
multiplied by operand size. The contents of memory* using the specified byte order arc read, 
treated as the size specified, zero-extended or sign-extended as specified, and placed into 
register rd. 

If alignment is specified, the computed virtual address must be aligned, that is, it must be an 
exact multiple of the size expressed in bytes. If the address is not aligned an "aco-ss 
disallowed by virtual address'* exception occurs. 

Definition 

def Loadfop.rd.rt.rb) as 
case op of 

L16L U2L LB. L16AL L32AL. L16B. L32B. LI6AB. l32Aa 
164L L64AL L64B. L64AB: 
signed «- true 

LU16U IU32L lite. LU16AL LU32AL LUI6B. LU32B. LU16AB, LU32AB. 
LU64L LU64AL LU64B. LU64AB: 

signed «- false 
L128L L128AL 11288. U28AB: 

signed «- undefined 

endcase 
cave op of 
IB. LU8 

sue 4- 8 

U6L, LU16L LI6AL LU16AL. L16a LU16BI L16AB, LJI6Aa 
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L32L IU32L. 132AL. LU32AL U2B. LU32a L32Aa LU32Aft 
sue 4- 32 

L64L IU64L L64AL LU64AL L64E LU648. L64A6L LU64/A 

sire 64 
D28L II28AL 1128a LI28AR 

sue «- 128 

endcase 

isue «- logfwej 

case op of 

U6L LU16L L32L LU32L L64L, IU64L U28L 
LI6AL LU16AL L32AL LU32AL L64AL LU64AL. LI28AL 
oroer L 

LI 6a LU16a L32a LU32a L64a LU64a LI 28a 
H6Aa LUI6^a L32Aa LU32Aa L64Aa LU64Aa LI28A& 
order «- B 

La ma 

order undefined 

endcase 

c +- RegReadtre, 64) 

b — RegReadfrtx 64) 

WtAddr c ♦ ft^jije 0 I ■ O^**- 3 ) 

case op of 

LI6AL LUI6AL L32AL LU32AL L64AL LU64AL. 1128AL 
Lt6Aa LUI6A& L32A&. LU32Aa L64A8. LU64Aa L128A& 
* JCtoe-4 J) * 0 then 

raise ^cessDisaflowed6yV1rtualAddress 

end* 

L16L. LUI6L L32L LU32L L64L LU64U LI28L 
LI 6a LUI6a L32a LU32B. LMa LUMa LI 28a 
L8. LU8 
endcase 

m «- LoadMenxxyKV»ftAddrjue.orderJ 
* «- l rT H«e-l and signed) 128 - 11 " 1 1 m 
RegWntefrd. 128. af 
enddef 

Exceptions 

Acer* dnaUowed by virtual nldrfN 
Acccat duattowvd by tag 
Accea. by global IB 

Accra* dna&crwvd by local TH 
Accew drtari rtqtared by lag 
Accrt« drtad required by local TH 
Acer* detail required by global ID 
l^cal TBoitw 
Global THnuM 
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Load Immediate 

"iliese operations compute a virtual address fn*m the content* ol" a register and a si|m- 
extended immediate value, load data from memory, si>m- o. /cn ►-extending the data to fill 
the destination register. 



Operation codes 



LL8' 6 


Load immediate signed byte 


U.I6AB 


Load immediate signed doublet aligned big-endian 


LI. 16 8 


Load immediate signed doublet big-endian 


LU6AL 


Load immediate signed doublet aligned frtde-enclan 


LI. 161 


Load immediate signed doublet finfe-endlan 


L.I.32AB 


Load immediate signed quadiet aligned big-endian 


Ll.32.8 


Load immediate signed quadiet big-endian 


U.32AL 


Load immediate signed quadiet aligned little-endian 


LI.32.L 


Load immediate signed quadiet little-endian 


LI.64AB 


Load immediate signed octJet afegned big-endia.i 


LI.64.B 


Load immediate signed octlet big-endian 1 


LI.64AL 


Load immediate signed octtet aagned fttfe-endian 


Ll.64.1 


Load immediate signed octlet ittle-endian 


U.128AB" 


Load immediate hexiet aligned big-endian 


LI.I2B.B" 


Load iinmediate hexiet oig-endian 


LI.128AL 1 * 


Load immediate hexiet aligned fttle-endian 


L.I.I28X" 


Load immediate hexiet Irtrte-endian 


LI.U.8" 


Load immediate unsigned byte 


LI.U.I6AB 


Load immediate unsigned doublet aligned big-endian 


LIU 16.8 


Load immediate unsigned doublet big-endian 


LI.U.I6AL 


Load immediate unsigned doublet aligned little-endian 


LI.U.I6.L 


Load immediate unsigned doublet little-endian 


U.U.32AB 


Load immediate unsigned quadiet aligned big-endian 


LI.U.32.B 


Load immediate unsigned quadiet big-endian 


LI.UJ2AL 


Load immediate unsigned quadiet aligned litrJe-endian 


LI.U.32.L 


Load immed .te unsigned quadiet little-endian 


LI.U.64AB 


Load immediate unsigned octlet aligned big-endian 


LI.U.64.B 


Load immediate unsigned octlet big-endian 


LI.U.64AL 


Load immediate unsigned octlet aligned little-endian 


U.U.64.L 


Load immediate unsigned octlet hrte-endian 



*U.8 need not distinguish brnmn bttle endian and litg endtan ordering, nor between aligned and unaligned, 
as onh a tingle bjte is Loaded. 

,7 I.I 12H.AB nerd not distinguish between sip*ed and unsigned, a* the healet fills the desnnation register. 
"I.I.I2H.B need not distinguish between signed and unsigned, »s the hexiet fills the destination register 
'"UlStLsl. need not distinguish berwren signed and unsigned, at the healet tills the destination register. 
• V IJ.I2H I. need not disnnguish between signed and unsigned. M the healei fill* the detftnanon register. 
: ' l.l.im need not distinguish between bttle endian and big endian ordenng. nor between aligned and unaligned, 
as only a tu.jfc byte n loaded 
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number format 


type 


size 


aUqnment 


orderinq 


«gned byte 




8 






unsigned byte 


U 


8 






signed tnteqer 




16 32 64 




L B 


signed integer aligned 




16 32 64 


A 


L B 


jnsigned integer 


u 


16 32 64 




L B 


unsigned integer aligned 


u 


16 32 64 


A 


L B 


register 




128 




L B 


register aligned 




128 


A 


L B 



format 

op rdsreoffset 
rd«op(rcoffsct) 

3J 24 23 18 17 12 It 0 

I °P I rd | rc I offset 1 

Pttcripfon 

An operand size, expressed in bytes, is specified by the instruction. A virtual address is 
computed from the sum of the contents of register rc and the sign-extended value of the 
offset field, multiplied by the operand state. The contents of memory using the specified byte 
order are read, treated as the size specified, zero-extended or sign-extended as specified, and 
placed into register rd. 

If alignment is specified, the computed virtual ad Ircss must be aligned, that is, it must be an 
exact multiple M the size expressed in bytes. If the address is not ahgncd an "access 
disallowed by virtual address" exception occurs. 

Definition 

def loadkrtmt&atetopjdstjorhri) as 
case op of 

UI6U U32L Ua UI6AL U32AL UI6B. U32B. UI6AB, UUAB 
U64L U64AU U64fl» U64AB: 

signed «- true 
UUI6C UU32L UU8. UUI6AL UU32AL 
UUI6a UU32B UU)6AS. UU32A& 
UU641. UU64AL UU64a UU64A8: 

signed false 
U\2BL UI28a U 1 2&AB: 

signed #- undefined 

endow 
case op of 
U8. UU8 

we 8 

UI6C UUIM, U16AL UUI6AL LII6B. IIU16B. LJI6AB. UUI6Aa 
we 16 
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ixmd Immcdisfc 



LI32L LIU32L LI32AL UU32AL LI32B. LIU32B. U32A8. L/U32AB: 
we 32 

11641. LIUMl. M64AL UUo4AL Li648rLIU64B. H64A8. LJJ64A3: 

Size 4- 64 
U\28L U128AL UI28B. LU?HA6: 

size 128 

endcase 

isi/e <- logjwej 

case op of 

LM6L LHJI6L LI32L UU321. U64L UU64L LI128L 
LII6AL LILM6AL U32AL LIU32A'.. LI64AL. LIU64AL U128AL 
order «- U 

Lit 6a UUI6B, U32B UU32B. U648. UU64a U128a 

LI16A8. LIUI6AB. U32Aa UU32AR U64Aa UU64Aa (U28AR: 

cro\rr «- B 
U8. UU8. 

order «- undefined 

endcase 

c «- RegRcad|rt. 64) 

WtAddr 4- c ♦ ' ffsetft-*"' 1 1 offset 1 1 O*" 2 ^ 3 ) 
case op of 

LM6AL UUIML U32AL UUi2AL LI64AL LIU64Ai, U128AL 
UI6Aa UU16A& N32AB. UU32Aa LJA4A8. UU64Aa Ut28A& 
* Kisi/e-4 0 * 0 then 

raise AccessOisaUowe^ByVirtualAddress 

eodif 

LU6L LIUI6L U^2L UU32L L'64L UU64L LM28L 
LI 1 68 UU168. U328. UU32B. LJ64a UU64B, UI28B 
U8. UU8 
endcase 

m LoadMemoryfc.WtAddr.stze.ordert 
a «- (rn^^j aod sigoedj 1 28- we j | m 
RegWntefrd. 128. a) 
enddef 

Exceptions 

. Veens diwdlcmrJ bv vim-tl fekl...** 
A cert* ih talk mnl b> tag 
Accr«« ilnaUawtd by global I H 

di%alk>wrd by kx*J I H 
\cce*» drfail required rn 
Aicr»« drtui rrtjiMrrJ by Ukm\ 111 
At c«* drful mjuirH I*)- gi*>b*l *!"h 
I steal I'M mm 
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